SYSTEM, METHOD AND ARTICLE OF MANUFACTURE FOR 
SOFTWARE-DESIGNED INTERNET RECONFIGURABLE HARDWARE 



RELATED APPLICATIONS 

This application claims priority from Provisional U.S. Patent Application entitled 
System, Method, and Article of Manufacture for a User Interface for Transferring 
Configuration Information for a Configuring a Device in Reconfigurable Logic, serial 
number 60/219,753, filed July 20, 2000, and which is incorporated herein by reference 
for all purposes. 



FIELD OF THE INVENTION 

The present invention relates to reconfigurable logic devices and more particularly to 
network-based configuratiomn of a logic device. 



BACKGROUND OF THE INVENTION 

It is well known that software-controlled machines provide great flexibility in that they 
can be adapted to many different desired purposes by the use of suitable software. As 
well as being used in the familiar general purpose computers, software-controlled 
processors are now used in many products such as cars, telephones and other domestic 
products, where they are known as embedded systems. 

However, for a given a function, a software-controlled processor is usually slower than 
hardware dedicated to that function. A way of overcoming this problem is to use a 
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special software-controlled processor such as a RISC processor which can be made to 
function more quickly for limited purposes by having its parameters (for instance size, 
instruction set etc.) tailored to the desired functionality. 

Where hardware is used, though, although it increases the speed of operation, it lacks 
flexibility and, for instance, although it may be suitable for the task for which it was 
designed it may not be suitable for a modified version of that task which is desired 
later. It is now possible to form the hardware on reconfigurable logic circuits, such as 
Field Programmable Gate Arrays (FPGA's) which are logic circuits which can be 
repeatedly reconfigured in different ways. Thus they provide the speed advantages of 
dedicated hardware, with some degree of flexibility for later updating or multiple 
functionality. 

In general, though, it can be seen that designers face a problem in finding the right 
balance between speed and generality. They can build versatile chips which will be 
software controlled and thus perform many different functions relatively slowly, or 
they can devise application-specific chips that do only a limited set of tasks but do them 
much more quickly. 

A compromise solution to these problems can be found in systems which combine both 
dedicated hardware and also software. The hardware is dedicated to particular 
functions, e.g. those requiring speed, and the software can perform the remaining 
functions. The design of such systems is known as hardware-software codesign. 

Within the design process, the designer must decide, for a target system with a desired 
functionality, which functions are to be performed in hardware and which in software. 
This is known as partitioning the design. Although such systems can be highly 
effective, the designer must be familiar with both software and hardware design. It 
would be advantageous if such systems could be designed by people who have 



EMB1P022 



familiarity only with software and which could utilize the flexibility of configurable 
logic resources. Further, it would be advantageous to implement into such systems i 
intuitive, ergonomic interface for selecting and transferring configuration data. 
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SUMMARY OF INVENTION 



A system, method and article of manufacture are provided for network-based 
5 configuration of a programmable logic device. A default application is initiated on a 
programmable logic device. A file request for configuration data from the logic device 
is sent to a server located remotely from the logic device utilizing a network. The 
configuration data is received from the network server, and can be in the form of a 
bitfile. The configuration data is used to configure the logic device to run a second 
10 application. The second application is run on the logic device. 

According to one embodiment of the present invention, the logic device includes one or 
more Field Programmable Gate Arrays (FPGAs). Preferably, a first FPGA receives the 
configuration data and uses that data to configure a second FPGA. The first and second 
15 FPGAs can be clocked at different speeds. 

According to another embodiment of the present invention, the default application and 
the second application are both able to run simultaneously on the logic device. The 
logic device can further include a display screen, a touch screen, an audio chip, an 
20 Ethernet device, a parallel port, a serial port, a RAM bank, a non-volatile memory, 
and/or other hardware components. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



The invention will be better understood when consideration is given to the following 
detailed description thereof. Such description makes reference to the annexed drawings 
wherein: 

Figure 1 is a schematic diagram of a hardware implementation of one embodiment of 
the present invention; 

Figure 2 is a flow diagram of a process for providing an interface for transferring 
configuration data to a reconfigurable logic device; 

Figure 3 depicts a display according to an exemplary embodiment of the present 
invention; 

Figure 4 illustrates an illustrative procedure for initiating a reconfigurable logic device 
according to the illustrative embodiment of Figure 3; 

Figure 5 depicts a process for using a reconfigurable logic device to place a call over th 
Internet according to the illustrative embodiment of Figure 3; 

Figure 6 illustrates a process for answering a call over the Internet; 

Figure 7 depicts a configuration screen for setting various parameters of telephony 
functions according to the illustrative embodiment of Figure 3; 

Figure 8A depicts an illustrative screen displayed upon receonfiguration of a 
reconfigurable logic device according to the illustrative embodiment of Figure 3; 
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Figure 8B depicts a process for providing a hardware-based reconfigurable multimedia 
device; 

Figure 9 is a diagrammatic overview of a board of the resource management device 
5 according to an illustrative embodiment of the present invention; 

Figure 10 depicts a JTAG chain for the board of Figure 9; 

Figure 11 shows a structure of a Parallel Port Data Transmission System according to 
10 an embodiment of the present invention; 

Q 

Nl Figure 12 is a flowchart that shows the typical series of procedure calls when receiving 

JE 15 Figure 13 is a flow diagram depicting the typical series of procedure calls when 
^ transmitting data; 

[£ Figure 14 is a flow diagram illustrating several processes running in parallel; 

'ass 

20 Figure 15 is a block diagram of an FPGA device according to an exemplary 
embodiment of the present invention; 

Figure 16 is a flowchart of a process for network-based configuration of a 
programmable logic device; 

25 

Figure 17 illustrates a process for remote altering of a configuration of a hardware 
device; and 

Figure 18 illustrates a process for processing data and controlling peripheral hardware. 

30 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

A preferred embodiment of a system in accordance with the present invention is 
5 preferably practiced in the context of a personal computer such as an IBM compatible 
personal computer, Apple Macintosh computer or UNIX based workstation. A 
representative hardware environment is depicted in Figure 1, which illustrates a typical 
hardware configuration of a workstation in accordance with a preferred embodiment 
having a central processing unit 110, such as a microprocessor, and a number of other 
10 units interconnected via a system bus 112. The workstation shown in Figure 1 includes 
p a Random Access Memory (RAM) 114, Read Only Memory (ROM) 116, an I/O 

sa adapter 1 18 for connecting peripheral devices such as disk storage units 120 to the bus 

112 a user interface adapter 122 for connecting a keyboard 124, a mouse 126, a speaker 

Ul 128, a microphone 132, and/or other user interface devices such as a touch screen (not 

lil 

£ 1 5 shown) to the bus 1 1 2, communication adapter 1 34 for connecting the workstation to a 
communication network (e.g., a data processing network) and a display adapter 136 for 
H* connecting the bus 112 to a display device 138. The workstation also includes a Field 

[q Programmable Gate Array (FPGA) 140 with a complete or a portion of an operating 

P system thereon such as the Microsoft Windows NT or Windows/98 Operating System 

20 (OS), the IBM OS/2 operating system, the MAC OS, or UNIX operating system. Those 
skilled in the art will appreciate that the present invention may also be implemented on 
platforms and operating systems other than those mentioned. 

A preferred embodiment is written using JAVA, C, and the C++ language and utilizes 
25 object oriented programming methodology. Object oriented programming (OOP) has 
become increasingly used to develop complex applications. As OOP moves toward the 
mainstream of software design and development, various software solutions require 
adaptation to make use of the benefits of OOP. A need exists for these principles of 
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OOP to be applied to a messaging interface of an electronic messaging system such that 
a set of OOP classes and objects for the messaging interface can be provided. 

OOP is a process of developing computer software using objects, including the steps of 
analyzing the problem, designing the system, and constructing the program. An object 
is a software package that contains both data and a collection of related structures and 
procedures. Since it contains both data and a collection of structures and procedures, it 
can be visualized as a self-sufficient component that does not require other additional 
structures, procedures or data to perform its specific task. OOP, therefore, views a 
computer program as a collection of largely autonomous components, called objects, 
each of which is responsible for a specific task. This concept of packaging data, 
structures, and procedures together in one component or module is called encapsulation. 

In general, OOP components are reusable software modules which present an interface 
that conforms to an object model and which are accessed at run-time through a 
component integration architecture. A component integration architecture is a set of 
architecture mechanisms which allow software modules in different process spaces to 
utilize each others capabilities or functions. This is generally done by assuming a 
common component object model on which to build the architecture. It is worthwhile 
to differentiate between an object and a class of objects at this point. An object is a 
single instance of the class of objects, which is often just called a class. A class of 
objects can be viewed as a blueprint, from which many objects can be formed. 

OOP allows the programmer to create an object that is a part of another object. For 
example, the object representing a piston engine is said to have a composition- 
relationship with the object representing a piston. In reality, a piston engine comprises 
a piston, valves and many other components; the fact that a piston is an element of a 
piston engine can be logically and semantically represented in OOP by two objects. 
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OOP also allows creation of an object that "depends from" another object. If there are 
two objects, one representing a piston engine and the other representing a piston engine 
wherein the piston is made of ceramic, then the relationship between the two objects is 
not that of composition. A ceramic piston engine does not make up a piston engine. 
Rather it is merely one kind of piston engine that has one more limitation than the 
piston engine; its piston is made of ceramic. In this case, the object representing the 
ceramic piston engine is called a derived object, and it inherits all of the aspects of the 
object representing the piston engine and adds further limitation or detail to it. The 
object representing the ceramic piston engine "depends from" the object representing 
the piston engine. The relationship between these objects is called inheritance. 

When the object or class representing the ceramic piston engine inherits all of the 
aspects of the objects representing the piston engine, it inherits the thermal 
characteristics of a standard piston defined in the piston engine class. However, the 
ceramic piston engine object overrides these ceramic specific thermal characteristics, 
which are typically different from those associated with a metal piston. It skips over the 
original and uses new functions related to ceramic pistons. Different kinds of piston 
engines have different characteristics, but may have the same underlying functions 
associated with it (e.g., how many pistons in the engine, ignition sequences, lubrication, 
etc.). To access each of these functions in any piston engine object, a programmer 
would call the same functions with the same names, but each type of piston engine may 
have different/overriding implementations of functions behind the same name. This 
ability to hide different implementations of a function behind the same name is called 
polymorphism and it greatly simplifies communication among objects. 

With the concepts of composition-relationship, encapsulation, inheritance and 
polymorphism, an object can represent just about anything in the real world. In fact, 
one's logical perception of the reality is the only limit on determining the kinds of 
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things tha, can become objects in object-oriented software. Some typical categones are 

as follows: 

. Objects can represent physical objects, such as automobiles in a traffic-flow 
simulation, electrical components in a circuit-design program, countries m an 
economics model, or aircraft in an air-traffic-control system. 

. Objects can represent elements of the computer-user environment such as 
windows, menus or graphics objects. 

. An object can represent an inventory, such as a personnel file or a table of the 
latitudes and longitudes of cities. 

. An object can represent user-defined data types such as time, angles, and 
complex numbers, or points on the plane. 

With this enormous capability of an object to represent just about any logically 
separable matters, OOP allows the software developer to design and .mplemen, a 
computer program 4a. is a model of some aspects of reality, whether that reality >s a 
physical entity, a process, a system, or a composition of matter. Since the object can 
represent anything, the software developer can create an object which can be used as a 
component in a larger software project in the future. 

If 90%of a new OOP software program consists of proven, existing components made 
from preexisting reusable objects, then only the remaining .0% of me new software 
project has to be written and tested from scratch. Since 90% already came from an 
inventory of extensively tested reusable objects, the potential domain from winch an 
error could originate is 10% of the program. As a result, OOP enables software 
developers to build objects out of other, previously built objects. 

This process closely resembles complex machinery being built out of assemblies and 
sub-assemblies. OOP technology, therefore, makes software engineenng more hke 
hardware engineering in that software is built from existing components, wh,ch are 
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available to the developer as objects. All this adds up to an improved quality of the 
software as well as an increased speed of its development. 

Programming languages are beginning to fully support the OOP principles, such as 
encapsulation, inheritance, polymorphism, and composition-relationship. With the 
advent of the C++ language, many commercial software developers have embraced 
OOP. C++ is an OOP language that offers a fast, machine-executable code. 
Furthermore, C++ is suitable for both commercial-application and systems- 
programming projects. For now, C++ appears to be the most popular choice among 
many OOP programmers, but there is a host of other OOP languages, such as Smalltalk, 
Common Lisp Object System (CLOS), and Eiffel. Additionally, OOP capabilities are 
being added to more traditional popular computer programming languages such as 
Pascal. 

The benefits of object classes can be summarized, as follows: 

Objects and their corresponding classes break down complex programming 
problems into many smaller, simpler problems. 

Encapsulation enforces data abstraction through the organization of data into 
small, independent objects that can communicate with each other. 
Encapsulation protects the data in an object from accidental damage, but allows 
other objects to interact with that data by calling the object's member functions 
and structures. 

Subclassing and inheritance make it possible to extend and modify objects 
through deriving new kinds of objects from the standard classes available in the 
system. Thus, new capabilities are created without having to start from scratch. 
Polymorphism and multiple inheritance make it possible for different 
programmers to mix and match characteristics of many different classes and 
create specialized objects that can still work with related objects in predictable 
ways. 
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Class hierarchies and containment hierarchies provide a flexible mechanism for 
modeling real-world objects and the relationships among them. 
Libraries of reusable classes are useful in many situations, but they also have 
some limitations. For example: 

Complexity. In a complex system, the class hierarchies for related classes can 
become extremely confusing, with many dozens or even hundreds of classes. 
Flow of control. A program written with the aid of class libraries is still 
responsible for the flow of control (i.e., it must control the interactions among 
all the objects created from a particular library). The programmer has to decide 
which functions to call at what times for which kinds of objects. 
Duplication of effort. Although class libraries allow programmers to use and 
reuse many small pieces of code, each programmer puts those pieces together in 
a different way. Two different programmers can use the same set of class 
libraries to write two programs that do exactly the same thing but whose internal 
structure (i.e., design) may be quite different, depending on hundreds of small 
decisions each programmer makes along the way. Inevitably, similar pieces of 
code end up doing similar things in slightly different ways and do not work as 
well together as they should. 

20 Class libraries are very flexible. As programs grow more complex, more programmers 
are forced to reinvent basic solutions to basic problems over and over again. A 
relatively new extension of the class library concept is to have a framework of class 
libraries. This framework is more complex and consists of significant collections of 
collaborating classes that capture both the small scale patterns and major mechanisms 

25 that implement the common requirements and design in a specific application domain. 
They were first developed to free application programmers from the chores involved in 
displaying menus, windows, dialog boxes, and other standard user interface elements 
for personal computers. 
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Frameworks also represent a change in the way programmers think about the interaction 
between the code they write and code written by others. In the early days of procedural 
programming, the programmer called libraries provided by the operating system to 
perform certain tasks, but basically the program executed down the page from start to 
5 finish, and the programmer was solely responsible for the flow of control. This was 
appropriate for printing out paychecks, calculating a mathematical table, or solving 
other problems with a program that executed in just one way. 

The development of graphical user interfaces began to turn this procedural 
programming arrangement inside out. These interfaces allow the user, rather than 
program logic, to drive the program and decide when certain actions should be 
performed. Today, most personal computer software accomplishes this by means of an 
event loop which monitors the mouse, keyboard, and other sources of external events 
and calls the appropriate parts of the programmer's code according to actions that the 
user performs. The programmer no longer determines the order in which events occur. 
Instead, a program is divided into separate pieces that are called at unpredictable times 
and in an unpredictable order. By relinquishing control in this way to users, the 
developer creates a program that is much easier to use. Nevertheless, individual pieces 
of the program written by the developer still call libraries provided by the operating 
system to accomplish certain tasks, and the programmer must still determine the flow of 
control within each piece after it's called by the event loop. Application code still "sits 
on top of the system. 

Even event loop programs require programmers to write a lot of code that should not 
25 need to be written separately for every application. The concept of an application 
framework carries the event loop concept further. Instead of dealing with all the nuts 
and bolts of constructing basic menus, windows, and dialog boxes and then making 
these things all work together, programmers using application frameworks start with 
working application code and basic user interface elements in place. Subsequently, they 



10 



15 



20 



EMB1P022 



- 14- 



build from there by replacing some of the generic capabilities of the framework with the 
specific capabilities of the intended application. 

Application frameworks reduce the total amount of code that a programmer has to write 
from scratch. However, because the framework is really a generic application that 
displays windows, supports copy and paste, and so on, the programmer can also 
relinquish control to a greater degree than event loop programs permit. The framework 
code takes care of almost all event handling and flow of control, and the programmer's 
code is called only when the framework needs it (e.g., to create or manipulate a 
proprietary data structure). 

A programmer writing a framework program not only relinquishes control to the user 
(as is also true for event loop programs), but also relinquishes the detailed flow of 
control within the program to the framework. This approach allows the creation of 
more complex systems that work together in interesting ways, as opposed to isolated 
programs, having custom code, being created over and over again for similar problems. 

Thus, as is explained above, a framework basically is a collection of cooperating classes 
that make up a reusable design solution for a given problem domain. It typically 
includes objects that provide default behavior (e.g., for menus and windows), and 
programmers use it by inheriting some of that default behavior and overriding other 
behavior so that the framework calls application code at the appropriate times. 

There are three main differences between frameworks and class libraries: 
• Behavior versus protocol. Class libraries are essentially collections of behaviors 
that you can call when you want those individual behaviors in your program. A 
framework, on the other hand, provides not only behavior but also the protocol 
or set of rules that govern the ways in which behaviors can be combined, 
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including rules for what a programmer is supposed to provide versus what the 
framework provides. 

• Call versus override. With a class library, the code the programmer instantiates 
objects and calls their member functions. It's possible to instantiate and call 
objects in the same way with a framework (i.e., to treat the framework as a class 
library), but to take full advantage of a framework's reusable design, a 
programmer typically writes code that overrides and is called by the framework. 
The framework manages the flow of control among its objects. Writing a 
program involves dividing responsibilities among the various pieces of software 
that are called by the framework rather than specifying how the different pieces 
should work together. 

• Implementation versus design. With class libraries, programmers reuse only 
implementations, whereas with frameworks, they reuse design. A framework 
embodies the way a family of related programs or pieces of software work. It 
represents a generic design solution that can be adapted to a variety of specific 
problems in a given domain. For example, a single framework can embody the 
way a user interface works, even though two different user interfaces created 
with the same framework might solve quite different interface problems. 

Thus, through the development of frameworks for solutions to various problems and 
programming tasks, significant reductions in the design and development effort for 
software can be achieved. A preferred embodiment of the invention utilizes HyperText 
Markup Language (HTML) to implement documents on the Internet together with a 
general-purpose secure communication protocol for a transport medium between the 
client and the Newco. HTTP or other protocols could be readily substituted for HTML 
without undue experimentation. Information on these products is available in T. 
Bemers-Lee, D. Connoly, "RFC 1866: Hypertext Markup Language - 2.0" (Nov. 1995); 
and R. Fielding, H, Frystyk, T. Berners-Lee, J. Gettys and J.C. Mogul, "Hypertext 
Transfer Protocol - HTTP/1.1 : HTTP Working Group Internet Draft" (May 2, 1996). 
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HTML is a simple data format used to create hypertext documents that are portable 
from one platform to another. HTML documents are SGML documents with generic 
semantics that are appropriate for representing information from a wide range of 
domains. HTML has been in use by the World-Wide Web global information initiative 
5 since 1990. HTML is an application of ISO Standard 8879; 1986 Information 
Processing Text and Office Systems; Standard Generalized Markup Language 
(SGML). 

To date, Web development tools have been limited in their ability to create dynamic 
10 Web applications which span from client to server and interoperate with existing 

computing resources. Until recently, HTML has been the dominant technology used in 
development of Web-based solutions. However, HTML has proven to be inadequate in 
the following areas: 

Poor performance; 
15 • Restricted user interface capabilities; 

Can only produce static Web pages; 

Lack of interoperability with existing applications and data; and 
Inability to scale. 

20 Sun Microsystem's Java language solves many of the client-side problems by: 

• Improving performance on the client side; 

• Enabling the creation of dynamic, real-time Web applications; and 

• Providing the ability to create a wide variety of user interface components. 



25 



With Java, developers can create robust User Interface (UI) components. Custom 
"widgets" (e.g., real-time stock tickers, animated icons, etc.) can be created, and client- 
side performance is improved. Unlike HTML, Java supports the notion of client-side 
validation, offloading appropriate processing onto the client for improved performance. 
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Dynamic, real-time Web pages can be created. Using the above-mentioned custom UI 
components, dynamic Web pages can also be created. 

Sun's Java language has emerged as an industry-recognized language for "programming 
the Internet." Sun defines Java as: "a simple, object-oriented, distributed, interpreted, 
robust, secure, architecture-neutral, portable, high-performance, multithreaded, 
dynamic, buzzword-compliant, general-purpose prpgramming language. Java supports 
programming for the Internet in the form of platform-independent Java applets." Java 
applets are small, specialized applications that comply with Sun's Java Application 
Programming Interface (API) allowing developers to add "interactive content" to Web 
documents (e.g., simple animations, page adornments, basic games, etc.). Applets 
execute within a Java-compatible browser (e.g., Netscape Navigator) by copying code 
from the server to client. From a language standpoint, Java's core feature set is based on 
C++. Sun's Java literature states that Java is basically, "C++ with extensions from 
Objective C for more dynamic method resolution." 

Another technology that provides similar function to JAVA is provided by Microsoft 
and ActiveX Technologies, to give developers and Web designers wherewithal to build 
dynamic content for the Internet and personal computers. ActiveX includes tools for 
developing animation, 3-D virtual reality, video and other multimedia content. The 
tools use Internet standards, work on multiple platforms, and are being supported by 
over 100 companies. The group's building blocks are called ActiveX Controls, small, 
fast components that enable developers to embed parts of software in hypertext markup 
language (HTML) pages. ActiveX Controls work with a variety of programming 
languages including Microsoft Visual C++, Borland Delphi, Microsoft Visual Basic 
programming system and, in the future, Microsoft's development tool for Java, code 
named "Jakarta." ActiveX Technologies also includes ActiveX Server Framework, 
allowing developers to create server applications. One of ordinary skill in the art 
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readily recognizes that ActiveX could be substituted for JAVA without undue 
experimentation to practice the invention. 

A preferred embodiment is written using Handel-C, a programming language developed 
from Handel. Handel was a programming language designed for compilation into 
custom synchronous hardware, which was first described in "Compiling occam into 
FPGAs", Ian Page and Wayne Luk in "FPGAs" Eds. Will Moore and Wayne Luk, pp 
271-283, Abingdon EE & CS Books, 1991, which are herein incorporated by reference. 
Handel was later given a C-like syntax (described in "Advanced Silicon Prototyping in 
a Reconfigurable Environment", M. Aubury, I. Page, D. Plunkett, M. Sauer and J. Saul, 
Proceedings of WoTUG 98, 1998, which is also incorporated by reference), to produce 
various versions of Handel-C. 

Handel-C is a programming language marketed by Celoxica Limited, 7 - 8 Milton Park, 
Abingdon, Oxfordshire, 0X14 4RT, United Kingdom. It enables a software or 
hardware engineer to target directly FPGAs (Field Programmable Gate Array) in a 
similar fashion to classical microprocessor cross-compiler development tools, without 
recourse to a Hardware Description Language, thereby allowing the designer to directly 
realize the raw real-time computing capability of the FPGA. 

Handel-C is designed to enable the compilation of programs into synchronous 
hardware; it is aimed at compiling high level algorithms directly into gate level 
hardware. 

25 The Handel-C syntax is based on that of conventional C so programmers familiar with 
conventional C will recognize almost all the constructs in the Handel-C language. For 
those not skilled in the art, more information about programming with Handel-C is 
provided in the documents entitled "Handel-C User manual," "Handel-C Language 
Reference Manual: version 3," "Handel-C Interfacing to other language code blocks," 
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and "Handel-C Preprocessor Reference Manual," each of which is available from 
Celoxica Limited, 7 - 8 Milton Park, Abingdon, Oxfordshire, 0X14 4RT, United 
Kingdom, and which are herein incorporated by reference in their entirety for all 
purposes. 

Sequential programs can be written in Handel-C just as in conventional C but to gain 
the most benefit in performance from the target hardware its inherent parallelism must 
be exploited. 

Handel-C includes parallel constructs that provide the means for the programmer to 
exploit this benefit in his applications. The compiler compiles and optimizes Handel-C 
source code into a file suitable for simulation or a netlist which can be placed and 
routed on a real FPGA. 

It should be noted that other programming and hardware description languages can be 
utilized as well, such as VHDL. 

Network-Configurable Hardware 

This section will detail the development of a flexible multimedia device according to an 
illustrative embodiment of the present invention using hardware that can be 
reconfigured over a network connection and runs software applications built directly in 
silicon. 

25 The illustrative platform developed for this purpose is called the Multimedia Terminal 
(MMT). It features no dedicated stored program and no Central Processing Unit (CPU). 
Instead, programs are implemented in Field Programmable Gate Arrays (FPGA) which 
are used both to control peripherals and to process data in order to create CPU-like 
flexibility using only reconfigurable logic and a software design methodology. 
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FPGAs can be used to create soft hardware that runs applications without the overhead 
associated with microprocessors and operating systems. Such hardware can be totally 
reconfigured over a network connection to provide enhancements, fixes, or a 
5 completely new application. Reconfigurability avoids obsolescence by allowing the 
flexibility to support evolving standards and applications not imagined when hardware 
is designed. This also allows manufacturers to use Internet Reconfigurable Logic to 
remotely access and change their hardware designs at any time regardless of where the 
units reside. 

10 

The MMT according to one exemplary embodiment of the present invention achieves 
flexible reconfigurability by using two independent one-million gate Xilinx XCV1000 
^ Virtex FPGAs. One of the FPGAs remains statically configured with networking 

N functionality when the device is switched on. The other FPGA is reconfigured with 

ry 15 data provided by the master. The two FPGAs communicate directly via a 36-bit bus 
~? ; with 4 bits reserved for handshaking and two 1 6-bit unidirectional channels as set forth 

=F in U.S. Patent Application entitled SYSTEM, METHOD, AND ARTICLE OF 

q MANUFACTURE FOR DATA TRANSFER ACROSS CLOCK DOMAINS, Attorney 

Docket Number EMB 1 P01 5 and filed concurrently herewith and assigned to common 

*B 20 assignee, and which is incorporated herein by reference for all purposes. The protocol 

D 

y, ensures that reliable communication is available even when the two FPGAs are being 

clocked at different speeds. 



The other components of the MMT are an LCD touch screen, audio chip, 10-Mbps 
25 Ethernet, parallel and serial ports, three RAM banks and a single non-volatile flash 
memory chip. 

FPGA reconfiguration can be performed by using one of two methods. The first 
method implements the Xilinx selectmap programming protocol on the static FPGA 
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which can then program the other. The second method supplies reconfiguration data 
from the network interface or from the flash memory on the MMT. Reconfiguration 
from flash memory is used only to load the GUI for a voice-over-intemet protocol 
(VoIP) telephone into the slave FPGA upon power-up, when an application has 
finished, or when configuration via the network fails. Network-based reconfiguration 
uses the Hypertext Transfer Protocol (HTTP) over a TCP connection to a, server. A text 
string containing a file request is sent by the MMT to the server which then sends back 
the reconfiguration data (a bitfile). 

There has thus been presented a flexible architecture that can run selected applications 
in an FPGA. Now will be described methods ofr writing all those applications and how 
to do it in a reasonable amount of time. Hardware Description Languages (HDL) are 
well-suited to creating interface logic and defining hardware designs with low-level 
timing issues. However, HDL may not be suitable for networking, VoIP, MP3s and 
15 video games. 

To meet the challenges of the system described above, the MMT design can be done 
using Handel-C. It is based on ANSI-C and is quickly learned by anyone that has done 
C software development. Extensions have been put in to support parallelism, variables 
of arbitrary width, and other features familiar in hardware design, but it very much 
targets software design methodologies. Unlike some of the prior art C-based solutions 
that translate C into an HDL, the Handel-C compiler directly synthesizes an EDIF 
netlist that can be immediately placed and routed and put onto an FPGA. 



The default application that runs on the illustrative embodiment of the MMT upon 
power-up is a Voice over Internet Protocol (VoIP) telephone complete with GUI. The 
voice over internet protocol consists of a call state machine, a mechanism to negotiate 
calls, and a Real Time Protocol (RTP) module for sound processing. A combination of 
messages from the GUI and the call negotiation unit are used to drive the state machine. 
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The protocol implemented by the call negotiation unit is a subset of H.323 Faststart 
(including H225 and Q931). This protocol uses TCP to establish a stream-based 
connection between the two IP telephones. The RTP module is responsible for 
processing incoming sound packets and generating outgoing packets sent over UDP. 

5 

Algorithms for protocols such as RTP, TCP, IP and UDP can be derived from existing 
public domain C sources. The source code can be optimized to use features available in 
Handel-C such as parallelism; this is useful for network protocols which generally 
require fields in a packet header to be read in succession and which can usually be 
10 performed by a pipeline with stages running in parallel. Each stage can be tested and 
? simulated within a single Handel-C environment and then put directly into hardware by 

SI generating an EDIF netlist. Further optimizations and tuning can be performed quickly 

nl simply by downloading the latest version onto the MMT over the network. 

LH 

=P 1 5 Because of the flexibility of the architecture and to take advantage of Internet 
n ^configurability, a mixed-bag of applications can be developed that all run in hardware 

if on the MMT. Among them are a fully-functional MP3 player with GUI, several video 

£1 games, and some impressive graphics demonstrations that were all developed using 

K Handel-C. These applications are hosted as bitfiles on a server that supplies these files 

20 upon demand from the user of the MMT over a network connection. 



25 Interface 

In accordance with the invention, an intuitive interface is provided for defining and 
transferring configuration files from a computer to a device in reconfigurable logic 
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Figure 2 is a flow diagram of a process 200 for providing an interface for transferring 
configuration data to a reconfigurable logic device, such as a Field Programmable Gate 
Array (FPGA), Programmable Logic Device (PLD), or Complex Programmable Logic 
Device (CPLD). In operation 202, images are presented on a display connected to a 
reconfigurable logic device. In operation 204, the user is allowed to input a command 
to configure the reconfigurable logic device by selecting one or more of the images. 
The configuration data is transferred from a computer to the reconfigurable logic device 
in operation 206 where it is used to reconfigure the reconfigurable logic device in 
operation 208. 

Other embodiments include a touch sensitive Liquid Crystal Display (LCD), buttons 
presented as bitmapped images to guide a user, interactive configuration of the device 
and its components and provides downloading via the Internet and a wireless network. 

In a preferred embodiment, the reconfigurable logic device is capable of saving the 
configuration data for later reuse. In another embodiment, the display is operable for 
inputting commands to control operation of the reconfigurable logic device. 

Example 1 

Figure 3 depicts a display 300 according to one embodiment of the present invention. 
The display is connected to a reconfigurable logic device, such as the one described 
below with respect to Figures 9-15. As an option, the display could be integrated with 
the device. 

An exemplary procedure 400 for initiating the device is shown in Figure 4. The device 
is connected to a network in operation 402 and a power source in operation 404. The 
display is calibrated in operation 406. In operation 408, on connecting power, the 
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device boots with a default programming. In this example, the device boots as an IP 
phone, ready to accept/receive calls. 

Referring again to Figure 3, the display includes several bitmapped buttons with which 
5 a user can input commands for use during a session of Internet telephony. Keypad 
buttons 302 are used to enter IP addresses to place a call. The status window 304 
displays the status of the device. 

In accordance with the present invention, a hardware-based reconfigurable Internet 
10 telephony system can be provided. The system includes a first Field Programmable 
Gate Array (FPGA) that is configured with networking functionality. A user interface 
is in communication with the first FPGA for presenting information to a user and 
receiving commands from a user. A microphone in communication with the first FPGA 
receives voice data from the user. A communications port is in communication with the 
15 first FPGA and the Internet. The first FPGA is configured to provide a call state 
machine, a call negotiation mechanism, and a Real Time Protocol (RTP) module for 
sound processing. See the discussion relating to Figures 5-7 for more detailed 
information about how to place a call. 

20 According to one embodiment of the present invention, a stream-based connection is 
generated between the system and another Internet telephony system. In another 
embodiment of the present invention, a second FPGA is configured for running a 
second application. In such an embodiment, the first FPGA can preferably configure 
the second FPGA. 



25 



In an embodiment of the present invention, the RTP module processes incoming sound 
packets and generates outgoing sound packets. In a preferred embodiment, the user 
interface includes a touch screen. 
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Figure 5 depicts a process 500 for using the device to place a call. (The process flow is 
from top to bottom.) The number key is pressed and then the IP address to be called is 
entered. As the numbers are typed, they appear in the status window. Once the number 
is entered, the accept button 306 is pressed to make the connection. The word "calling" 
appears in the status window to denote that the connection is pending. Upon making 
the connection, "connected" appears in the status window. To end the call, the end 
button 308 is pressed. 

Figure 6 illustrates the process 600 to answering a call. The status window displays 
"incoming call" and the device may sound a tone. The user selects the accept button to 
answer the call. Selection of the end button terminates the call. 

Figure 7 depicts a configuration screen 700 for setting various parameters of the 
telephony functions. The buttons 702, 704 having the plus and minus signs are used to 
increase and decrease speaker volume, microphone volume, etc. Mute buttons 706 and 
display brightness buttons 708. 

One skilled in the art will recognize that the device operates much like a traditional 
telephone and therefore, can include many of the features found in such telephones. 

The screen shown in Figure 3 includes several buttons other than those discussed above. 
Selecting the mp3 button 310 initiates a download sequence ordering the device to 
request configuration information to reconfigure the device to play audio in the mp3 
format. Once the configuration information is received, the device reconfigures itself to 
play mp3 audio. 

Upon reconfiguration, the display presents the screen 800 shown in Figure 8A. The 
various buttons displayed include a play button 802, a stop button 804, track back and 
track forward buttons 806, 808, a pause button 810, a mute button 812, volume up and 
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down buttons 814, 816 and an exit button 818 that returns to the default program, in this 
case, the IP telephony program. 

Upon selection of the saver button 820, the configuration information is stored for 
reconfiguration of the device without requiring a download, if the device has access to 
sufficient storage for the information. 

Referring again to Figure 3, selection of the game button 312 initiates a download 
sequence ordering the device to request configuration information to reconfigure the 
device to allow playing of a game. 

Multimedia Device 

Figure 8B depicts a process 850 for providing a hardware-based reconfigurable 
multimedia device. In operation 852, a default multimedia application is initiated on a 
reconfigurable multimedia logic device, which can be a device similar to that discussed 
with respect to Figures 9-15. A request for a second multimedia application is received 
from a user in operation 854. Configuration data is retrieved from a data source in 
operation 856, and, in operation 858, is used to configure the logic device to run the 
second multimedia application. In operation 860, the second multimedia application is 
run on the logic device. 

According to the present invention, the multimedia applications can include an audio 
application, a video application, a voice-based application, a video game application, 
and/or any other type of multimedia application. 

In one embodiment of the present invention, the configuration data is retrieved from a 
server located remotely from the logic device utilizing a network such as the Internet. 
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In another embodiment of the present invention, the logic device includes one or more 
Field Programmable Gate Arrays (FPGAs). Ideally, a first FPGA receives the 
configuration data and uses the configuration data to configure a second FPGA. 
Another embodiment of the present invention includes first and second FPGAs that are 
clocked at different speeds. In a preferred embodiment, the default multimedia 
application and the second multimedia application are both able to run simultaneously 
on the logic device, regardless of the number of FPGAs. 

Illustrative Reconfigurable Logic Device 

A reconfigurable logic device according to a preferred embodiment of the present 
invention includes a bi-directional 16 bit communications driver for allowing two 
FPGAs to talk to each other. Every message from one FPGA to the other is preceded by 
a 16 bit ID, the high eight bits of which identify the type of message (AUDIO, FLASH, 
RECONFIGURATION etc. . .) and the low identify the particular request for that 
hardware (FLASH_READ etc. . .). The id codes are processed in the header file 
fpOserver.h, and then an appropriate macro procedure is called for each type of message 
(e.g. for AUDIO AudioRequest is called) which then receives and processes the main 
body of the communication. 

Preferably, the FPGAs are allowed to access external memory. Also preferably, 
arbitration is provided for preventing conflicts between the FPGAs when the FPGAs 
access the same resource. Further, the need to stop and reinitialize drivers and hardware 
when passing from one FPGA to the other is removed. 

As an option, shared resources can be locked from other processes while 
communications are in progress. This can include communications between the FPGAs 
and/or communication between an FPGA and the resource. 
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In one embodiment of the present invention, an application on one of the FPGAs is 
allowed to send a command to another of the FPGAs. In another embodiment of the 
present invention, one or more of the FPGAs is reconfigured so that it can access the 
resource. 

In use, the server process requires a number of parameters to be passed to it. These are: 

• PID: Used for locking shared resources (such as the FLASH) from other 
processes while communications are in progress. 

• usendCommand, uSendLock: A channel allowing applications on FPO to send 
commands to applications on FP1 and a one-bit locking variable to ensure the 
data is not interleaved with server-sent data. 

• uSoundOut, uSoundlh: Two channels mirroring the function of the audio 
driver. Data sent to uSoundOut will be played (assuming the correct code in 
FP1) out of the MMT2000 speakers, and data read from uSoundln is the input to 
the MMT2000 microphone. The channels are implemented in such a way that 
when the sound driver blocks, the communication channel between FPGAs is 
not held up. 

• MP3Run: A one bit variable controlling the MP3 GUI. The server will activate 
or deactivate the MP3 GUI on receipt of commands from FP1 . 

• ConfigAddr: A 23 bit channel controlling the reconfiguration process. When 
the flash address of a valid FPGA bitfile is sent to this channel, the server 
reconfigures FP1 with the bitmap specified. 
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The data transfer rate between the two FPGAs in either direction is preferably about If 
bits per 5 clock cycles (in the clock domain of the slowest FPGA), for communicating 
between FPGAs that may be running at different clock rates. 

Several Handel-C macros which may be generated for use in various implementations 
of the present invention are set forth in Table 1. The document "Handel-C Language 
Reference Manual: version 3," incorporated by reference above, provides more 
information about generating macros in Handel-C. 



Table 1 


Filename 


Type 


Macro Name 


Purpose 


FpOserver.h 


Resource server 


FpOserverO 


Resource server for FPO for the 








MMT2000 IPPhone/MP3 








project 


Audiorequest.h 


Audio Server 


AudioRequestO 


Audio server for allowing 






sharing of sound hardware 


Flashrequest.h 


Data server 


FlashRequestO 


Server for allowing FP1 access 








to the FLASH memory 


Mp3request.h 


MP3 server 


MP3Request() 


Server to control the MP3 






application and feed it MP3 








bitstream data when requested. 


Reconfigurerequest.h 


Reconfiguration 


Reconfigurereq 


Allows FP1 to request to be 




hardware 


uestO 


reconfigured, at an application 








exit. 


Fpgacomms.h 


Communications 


FpgacommsO 


Implements two unidirectional 




hardware 




16 bit channels for 








communicating between the 








two FPGAs 
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Illustrative Device Development Platform 

Figure 9 is a diagrammatic overview of a board 900 of the resource management device 
according to an illustrative embodiment of the present invention. It should be noted that 
the following description is set forth as an illustrative embodiment of the present 
invention and, therefore, the various embodiments of the present invention should not 
be limited by this description. As shown, the board can include two Xilinx Virtex™ 
2000e FPGAs 902, 904, an Intel StrongARM SA1 1 10 processor 906, a large amount of 
memory 908, 910 and a number of I/O ports 912. Its main features are listed below: 

Two XCV 2000e FPGAs each with sole access to the following devices: 
Two banks (1 MB each) of SRAM (256Kx32 bits wide) 
Parallel port 
Serial port 
ATA port 

The FPGAs share the following devices: 
VGA monitor port 
Eight LEDs 

2 banks of shared SRAM (also shared with the CPU) 
USB interface (also shared with the CPU) 

The FPGAs are connected to each other through a General Purpose I/O (GPIO) bus, a 
32 bit SelectLink bus and a 32 bit Expansion bus with connectors that allow external 
devices to be connected to the FPGAs. The FPGAs are mapped to the memory of the 
StrongARM processor, as variable latency I/O devices. 

The Intel StrongARM SA1 110 processor has access to the following: 
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64Mbytes of SDRAM 
16Mbytes of FLASH memory 
LCD port 
IRDA port 
Serial port 

It shares the USB port and the shared SRAM with the FPGAs. 

In addition to these the board also has a Xilinx XC95288XL CPLD to implement a 
number of glue logic functions and to act as a shared RAM arbiter, variable rate clock 
generators and JTAG and MultiLinx SelectMAP support for FPGA configuration. 

A number of communications mechanisms are possible between the ARM processor 
and the FPGAs. The FPGAs are mapped into the ARM's memory allowing them to be 
accessed from the ARM as through they were RAM devices. The FPGAs also share two 
1 MB banks of SRAM with the processor, allowing DMA transfers to be performed. 
There are also a number of direct connections between the FPGAs and the ARM 
through the ARM's general purpose I/O (GPIO) registers. 

The board is fitted with 4 clocks, 2 fixed frequency and 2 PLLs. The PLLs are 
programmable by the ARM processor. 

The ARM is configured to boot into Angel, the ARM onboard debugging monitor, on 
power up and this can be connected to the ARM debugger on the host PC via a serial 
link. This allows applications to be easily developed on the host and run on the board. 

There are a variety of ways by which the FPGAs can be configured. These are: 

• By an external host using JTAG or MultiLinx SelectMAP 

• By the ARM processor, using data stored in either of the Flash RAMs or data 
acquired through one to the serial ports (USB, IRDA or RS232). 
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• By the CPLD from power-up with data stored at specific locations in the FPGA 
FlashRAM. 

• By one of the other FPG As. 

Appendices A and B set forth the pin definition files for the master and slave FPGAs on 
the board. Appendix C describes a parallel port interface that gives full access to all the 
parallel port pins. Appendix D discusses a macro library for the board of the present 
invention. 

StrongARM 

The board is fitted with an Intel SA1 110 Strong ARM processor. This has 64Mbytes of 
SDRAM connected to it locally and 16Mbytes of Intel StrataFLASH™ from which the 
processor may boot. The processor has direct connections to the FPGAs, which are 
mapped to its memory map as SRAM like variable latency I/O devices, and access to 
various I/O devices including USB, IRDA, and LCD screen connector and serial port. It 
also has access to 2MB of SRAM shared between the processor and the FPGAs. 

Memory Map 

The various devices have been mapped to the StrongARM memory locations as shown 
in Table 2: 



Table 2 


Address Location 


Contents 


0x00000000 


Flash Memory 16 MB 16 bits wide. 


0x08000000 


CPLD see CPLD section for list of registers 


0x10000000 


Shared RAM bank 1 256K words x32 


0x18000000 


Shared RAM bank 0 256K words x32 
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0x40000000 


FPGA access ( nCS4) 


0x48000000 


FPGA access (nCS5) 


OxCOOOOOOO 


SDRAM bank 0 


OxDOOOOOOO 


SDRAM bank 1 



The suggested settings for the StrongARM's internal memory configuration registers 
are shown in Table 3: 

5 

Table 3 





Register 


Value | 




MDCNFG 


0xA165 A165 




MDREF 


Ox 8230 02E1 


w 


MDCADS0 


Ox 5555 5557 




MDCAS1 


Ox 5555 5555 


E : 
ps 


MDCAS2 


Ox 5555 5555 


as - 


MSC0 


0x2210 4B5C 


O 


MSC1 


Ox 0009 0009 




MSC2 


Ox 2210 2210 



Where the acronyms are defined as: 

MDCNFG - DRAM configuration register 

MSC0,1,2 - Static memory control registers for banks 0,1,2 

MDREF -DRAM refresh control register 

MDCAS - CAS rotate control register for DRAM banks 
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The CPU clock should be set to 191.7MHz (CCF = 9). Please refer to the StrongARM 
Developers Manual, available from Intel Corporation, for further information on how i 
access these registers. 



FLASH memory 

The Flash RAM is very slow compared to the SRAM or SDRAM. It should only be 
used for booting from; it is recommended that code be copied from Flash RAM to 
SDRAM for execution. If the StrongARM is used to update the Flash RAM contents 
then the code must not be running from the Flash or the programming instructions in the 
Flash will get corrupted. 



SDRAM 

A standard 64MB SDRAM SODIMM is fitted to the board and this provides the bulk of 
the memory for the StrongARM. Depending upon the module fitted the SDRAM may 
not appear contiguous in memory. 

Shared RAM banks 

These RAM banks are shared with both FPGAs. This resource is arbitrated by the 
CPLD and may only be accessed once the CPLD has granted the ARM permission to do 
so. Requesting and receiving permission to access the RAMs is carried out through 
CPLD register 0x10. Refer to the CPLD section of this document for more information 
about accessing the CPLD and its internal registers from the ARM processor. See 
Appendix D. 



FPGA access 

The FPGAs are mapped to the ARM's memory and the StrongARM can access the 
FPGAs directly using the specified locations. These locations support variable length 
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accesses so the FPGA is able to prevent the ARM from completing the access until the 
FPGA is ready to receive or transmit the data. To the StrongARM these will appear as 
static memory devices, with the FPGAs having access to the Data, Address and Chip 
Control signals of the RAMs. 

The FPGAs are also connected to the GPIO block of the processor via the SAIO bus. 
The GPIO pins map to the SAIO bus is shown in Table 4. 



Table 4 



GPIO pins 


SAIO lines 


0,1 


0,1 


10,11 


2,3 


17-27 


4-14 



Of fliese SAIO[0:10] connect to the FPGAs and SAIO[0:14] connect to connector O 
on the board. The FPGAs and ARM are also able to access 2MB of shared memory, 
allowing DMA transfers between the devices to be performed. 

I/O Devices 

The following connectors are provided: 

• LCD Interface connector with backlight connector 

• IRDA connector (not 5V tolerant) 

• GPIO pins (not 5V tolerant) 

• Serial port 

• Reset button to reboot the StrongARM 
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The connections between these and the ARM processor are defined below in Tables 5- 
8: 



Table 5: ARM - LCD connections (CN27) 





LCD connector 
pin no. 


ARM pin 


Description 




10..6 


LCD0..4 


BLUE0..4 




18..16 


LCD5..7 


GREEN0..2 




15..13 


GPI02..GPI04 


GREEN3..5 




24..20 


GPI05..GPI09 


RED0..RED4 




27 


LCDFCLK 


16 




28 


LCDLCLK 


17 


-4 

FU 


29 


LCD_PCLK 


18 


m 
y 


4 


LCD_BIAS 


19 




2,3, (1) 




+5V 


E t 


(1), 5,11,12,19, 




GND 


=a : 


25,26,30 







E im c 

Table 6: ARM IRDA connections (CN8A) 



IRDA 




connector pin 


ARM pin 


Description 


no. 




2 


RxD2 




1 


TxD2 




3 


GPI012 




4 


GPI013 




5 


GPI014 
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6,8 




GND 


7 




+3.3V 



Table 7: ARM GPIO - CN20AP connections 



CN20AP pin no. 


GPIO pins 


2,3 


0,1 


4,5 


10, 11 


6-16 


17-27 


17,19 


+3.3V 


18,20 


GND 



Table 8: ARM - Serial Port connections (CN23) 



Serial Port 
connector pin no. 


ARM pin 


Description 


2 


RxDl 




8 


RxD3 




3 


TxDl 




7 


TxD3 




1,4,6,9 




Not connected 


5 




GND 



The serial port is wired in such away that two ports are available with a special lead if 
handshaking isn't required. 

Angel 
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Angel is the onboard debug monitor for the ARM processor. It communicates with the 
host PC over the serial port (a null modem serial cable will be required). The ARM is 
setup to automatically boot into Angel on startup - the startup code in the ARM's Flash 
RAM will need to be changed if this is not required. 

When Angel is in use 32MBs of SDRAM are mapped to 0x00000000 in memory and 
are marked as cacheable and bufferable (except the top 1MB). The Flash memory is 
remapped to 0x40000000 and is read only and cacheable. The rest of memory is 
mapped one to one and is not cacheable or bufferable. 

O Under Angel it is possible to run the FPGA programmer software which takes a bitfile 

S from the host machine and programs the FPGAs with it. As the .bit files are over 1MB 

=1 in size and a serial link is used for the data transfer this is however a very slow way of 

ry 

^1 configuring the FPGAs. 

£ 15 

L Virtex FPGA's 



10 



Two Virtex 2000e FPGAs are fitted to the board. They may be programmed from a 
variety of sources, including at power up from the FLASH memory. Although both 
20 devices feature the same components they have different pin definitions; Handel-C 
header files for the two FPGAs are provided. 

One of the devices has been assigned 'Master', the other 'Slave'. This is basically a 
means of identifying the FPGAs, with the Master having priority over the Slave when 
25 requests for the shared memory are processed by the CPLD. The FPGA below the serial 
number is the Master. 

One pin on each of the FPGAs is defined as the Master/Slave define pin. This pin is 
pulled to GND on the Master FPGA and held high on the Slave. The pins are: 
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MasterFPGA:C9 
Slave FPGA: D33 

The following part and family parameters should be used when compiling a Handel-C 
program for these chips: 

set family = Xilinx4 000E; 
set part = "XV2000e-6-fg680" ; 

Clocks 

Two socketed clock oscillator modules may be fitted to the board. CLKA is fitted with a 
50 MHz oscillator on dispatch and the CLKB socket is left to be fitted by the user 
should other or multiple frequencies to required. A +5 V oscillator module should be 
used for CLKB. 

Two on board PLLs, VCLK and MCLK, provide clock sources between 8MHz and 
100MHz (125MHz may well be possible). These are programmable by the ARM 
processor. VCLK may also be single stepped by the ARM. 

This multitude of clock sources allows the FPGAs to be clocked at different rates, or to 
let one FPGA have multiple clock domains. 

The clocks are connected to the FPGAs, as described in Table 9 and Appendices A and 
B: 

Table 9 



Clock Master FPGA Slave FPGA 
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pin 


pin | 


CLKA 


A20 


D21 


CLKB 


D21 


A20 


VCLK 


AW19 


AU22 


MCLK 


AU22 


AW19 



Programming the FPGAs 

The FPGAs may be programmed from a variety of sources: 

• Parallel III cable JTAG 

• MultiLinx JTAG 

• MultiLinx SelectMAP 

• ARM processor 

• From the other FPGA 

. Power up from FLASH memory ( FPGA FLASH memory section). 

When using any of the JTAG methods of programming the FPGAs you must ensure that 
the Bitgen command is passed the option ««-g startupclk:jtagclk ". You will also need a 
jed file for the CPLD or a .bsd file, which may be found in 

»Xilinx\xc9500xl\data\xc95288XL_tql44.bsd». The StrongARM also requires a .bsd 
file which may be found on the Intel website http VMpvHo perintel.com/design/ 
c^n g /h s H1/ sa 1110 bl.bsd. When downloaded this file will contain HTML headers and 
footers which will need to be removed first. Alternatively, copies of the required .bsd 
files are included on the supplied disks. 

The JTAG chain 1000 for the board is shown in Figure 10. 
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The connections when using the Xilinx Parallel IE cable and the 'JTAG Programmer' 
are set forth in Table 10: 



Table 10: Parallel HI Cable JTAG 



CN24 pin number 


JTAG Connector | 


i 


TMS 


2 


cut pin 


3 


TDI 


4 


TDO 


5 


not used 


6 


TCK 


7 


not used 


8 


GND 


9 


POWER 



With the Xilinx cables it may be easier to fit the flying ends into the Xilinx pod so that a 
number of cables may be connected to the board in one go. 

MultiLinx JTAG 

The board has support for programming using MultiLinx. CN3 is the only connector 
required for JTAG programming with MultiLinx and is wired up as described in Table 
11. (Note that not used signals may be connected up to the MultiLinx if required.) 



Table 11 



CN3 pin number 


MultiLinx 


i 


not used 


3 


RD(TDO) 


5 


not used 


7 


not used 


9 


TDI 



CN3 pin number 


MultiLinx 


2 


Vcc 


4 


GND 


6 


not used 


8 


not used 


10 


not used 
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11 


TCK 


13 


TMS 


15 


not used 


17 


not used 


19 


not used 



12 


not used 


14 


not used 


16 


not used 


18 


not used 


20 


not used 



MultiLinx SelectMAP 



FU 
m 



10 



JP3 must be fitted when using MulitLinx SelectMap to configure the FPGAs. This link 
prevents the CPLD from accessing the FPGA databus to prevent bus contention. This 
also prevents the ARM accessing the FPGA Flash memory and from attempting FPGA 
programming from power up. Connectors CN3 and CN4 should be used for Master 
FPGA programming and CN10 and CN1 1 for programming the Slave FPGA. See 
Tables 12-13. 



Table 12 



CN3/CN10pin 


MultiLinx 


number 




1 


not used 


3 


not used 


5 


not used 


7 


not used 


9 


not used 


11 


not used 


13 


not used 


15 


not used 


17 


not used 


19 


not used 



CN3/CN10pin 


MultiLinx 


number 




2 


+3v3 


4 


GND 


6 


not used 


8 


CCLK 


10 


DONE 


12 


not used 


14 


nPROG 


16 


nINIT 


18 


not used 


20 


not used 



EMB1P022 



-43- 



Table 13 



CN4/CN11 pin 


MultiLinx 




CN4/CN11 pin 


MultiLinx 


number 






number 




1 


cso 




2 


DO 


3 


not used 




4 


Dl 


5 


not used 




6 


D2 


7 


not used 




8 


D3 


9 


not used 




10 


D4 


11 


not used 




12 


D5 


13 


RS 

(RDWR) 




14 


D6 


15 


not used 




16 


D7 


17 


DY/BUSY 




18 


not used 


19 


not used 




20 


not used 



In practice MultiLinx SelectMap was found to be a very tiresome method of 
programming the FPGAs due to the large number of flying leads involved and the fact 
that the lack of support for multi FPGA systems means that the leads have to connected 
to a different connector for configuring each of the FPGA. 

ARM processor 

The ARM is able to program each FPGA via the CPLD. The FPGAs are set up to be 
configured in SelectMap mode. Please refer to the CPLD section of this document and 
Xilinx Datasheets on Virtex configuration for more details of how to access the 
programming pins of the FPGAs and the actual configuration process respectively. An 
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ARM program for configuring the FPGAs with a .bit file from the host PC under Angel 
is supplied. This is a very slow process however as the file is transferred over a serial 
link. Data could also be acquired from a variety of other sources including USB and 
IRDA or the onboard Flash RAMs and this should allow an FPGA to be configured in 
under 0.5 seconds. 

Configuring one FPGA from the other FPGA 

One FPGA is able to configure the other through the CPLD in a manner similar to when 
the ARM is configuring the FPGAs. Again, please refer to the CPLD section of this 
document and the Xilinx data sheets for more information. 

Configuring on power up from Flash Memory 

The board can be set to boot the FPGAs using configuration data stored in this memory 
on power up. The following jumpers should be set if the board is required to boot from 
the Flash RAM: 

• JP1 should be fitted if the Master FPGA is to be programmed from power up 

• JP2 should be fitted if the Slave FPGA is to be programmed from power up. 

If these jumpers are used the Flash RAM needs to be organized as shown in Table 14: 

Table 14 



Open 


Open 


All of FLASH memory available for FLASH 






data 


Fitted 


Open 


Master FPGA configuration data to start at 






address 0x0000 


Open 


Fitted 


Slave FPGA configuration data to start at 
. 
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in 



10 







address 0x0000 


Fitted 


Fitted 


Master FPGA configuration data to start at 
address 0x0000 followed by slave FPGA 
configuration data. 



The configuration data must be the configuration bit stream only, not the entire .bit file. 
The .bit file contains header information which must first be stripped out and the bytes 
of the configuration stream as stored in the .bit file need to be mirrored - i.e. a 
configuration byte stored as 001 10001 in the bit file needs to be applied to the FPGA 
configuration data pins are 10001 100. 

For more information on configuration of Xilinx FPGAs and the .bit format refer to the 
appropriate Xilinx datasheets. 

FPGA FLASH Memory 



15 



16 MB of Intel StrataFLASH ™ Flash memory is available to the FPGAs. This is 
shared between the two FPGAs and the CLPD and is connected directly to them. The 
Flash RAM is much slower than the SRAMs on the board, having a read cycle time of 
120ns and a write cycle of around 80ns. 



20 



The FPGAs are able to read and write to the memory directly, while the ARM processor 
has access to it via the CPLD. Macros for reading and writing simple commands to the 
Flash RAM's internal state machine are provided in the klib.h macro library (such as 
retrieving identification and status information for the RAM), but it is left up to the 
developer to enhance these to implement the more complex procedures such as block 
programming and locking. The macros provided are intended to illustrate the basic 
mechanism for accessing the Flash RAM. 



25 
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When an FPGA requires access to the Flash RAM it is required to notify the CLPD by 
setting the Flash Bus Master signal low. This causes the CPLD to tri-state its Flash 
RAM pins to avoid bus contention. Similarly, as both FPGAs have access to the Flash 
RAM over a shared bus, care has to be taken that they do not try and access the memory 
at the same time (one or both of the two FPGAs may be damaged if they are driven 
against each other). It is left up to the developer to implement as suitable arbitration 
system if the sharing of this RAM across both FPGAs is required. 

The connections between this RAM and the FPGAs are set forth in Table 15: 



Table 15 



Flash RAM pin 


Master FPGA 


Slave FPGA pin 


Fnn 


C.7 


C.7 


FD1 


P4 


P4 


FD9 


PT 


P^ 


Fro 


R1 


R1 


FD4 


Am 


Aro 


Fro 


AG9 


AG? 


Fro 


AH1 


AH1 


FD7 


AR4 


AR4 


FDR 


R91 


A91 


FDQ 


r.?.i 


r.?i 


Fnm 


A9.1 


R91 


FD1 1 


F.9.9 


rm 


FD19. 


R?fl 


A 7.7 


fdh 


r>?? 


F.9.1 


FD14 


r.9.1 


R9? 


FD1S 


R1Q 


R94 


FAD 


A9R 


A14 


FA1 


C7R 


Dlfi 


FA?. 


mi 


R1S 


FAT 


T)7.1 


r.ifi 


FA4 


A 77 


A 1 5 


FAS 


r.9.7 


F.I 7 


FAfi 


R9fi 


R1fi 


FA7 




D17 


FAR 




CI 7 


FAQ 


A7.fi 


A16 


FA10 


D7.S 


F.I 8 
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FA11 


R25 


R17 


FA17 


r.?.s 


D18 


FAH 


A?.S 


A17 


FA14 


n?4 


r.18 


FA1S 


A?4 


R18 


FA16 


R7T 


D1Q 


FA17 


r.?4 


A18 


FAIR 


A7T 


T.1Q 


FA1Q 


R74 


R1Q 


FA 90 


R7? 


r.?.i 


FA?1 


F.?1 


n?? 


FA??. 


A 7.7. 


n /.» i 


FA?3 


r>?t 


F.7? 


nr.F. 


r.iQ 


A?3 


nWF. 


A18 


r.?4 


STS 


D1Q 


R71 


nOF. 


R18 


r.?4 


nBYTE 


C18 


B24 


F bus master t>in 


C17 


C26 



% Local SRAM 



Each FPGA has two banks of local SRAM, arranged as 256K words x 32bits. They 



s=s 5 

fit 

[p have an access time of 1 5ns. 

o 



In order to allow single cycle accesses to these RAMs it is recommended that the 
external clock rate is divided by 2 or 3 for the Handel-C clock rate. I.e. include the 
10 following line in your code: 

set clock = external_divide "A20" 2; // or higher 
For an extemal_divide 2 clock rate the RAM should be defined as: 

15 

macro expr sram_local_bankO_spec = 
{ 
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offchip = 1, 

wegate = 1, 

data = DATA_jpins, 

addr = ADDRESS_pins , 

cs = { "E2", "Fl" , "J4", "F2" , 

"H3" } , 

we = { "H4" }, 
oe = { "El" } 

}; 

If the clock is divided by more than 2 replace the wegate parameter with 

westart=2, 
welength=l, 

The connections to these RAMs are as follows: 



Table 16 



SRAM Pin 


Master 


Slave FPGA 


Master 


Slave FPGA 




FPGA 


SRAM 0 


FPGA 


SRAM 1 




SRAM 0 




SRAM 1 




D31 


Wl 


AA39 


AT3 


AR37 


D30 


AB4 


AB35 


AP3 


AR39 


D29 


AB3 


Y38 


AR3 


AR36 


D28 


W2 


AB36 


AT2 


AT38 


D27 


AB2 


Y39 


AP4 


AR38 


D26 


V1 


AB37 


AR2 


AP36 


D25 


AA4 


AA36 


ATI 


AT39 


D24 


V2 


W39 


AN4 


AP37 


D23 


AA3 


AA37 


ARl 


AP38 


D22 


Ul 


W38 


AN3 


AP39 
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D21 



W3 



W37 



AP2 



AN36 
AN38 



D20 



U2 



V39 



AN2 



D19 



W4 



W36 



API 



AN37 
AN39 



D18 



T1 



U39 



AM4 



D17 



V3 



V38 



AN1 



AM36 
AM38 



D16 
D15 
D14 
D13 
D12 
D11 
DIP 
D9 
D8 
D7 
D6 
D5 
D4 
D3 
D2 
D1 
DO 



A17 
A16 
A15 
A14 
A13 
A12 
A11 
AIO 
A9 
A8 
A7 
A6 
A5 
A4 
A3 
A2 
A1 
AO 



T2 

V4 

V5 

IJ3 

R2 

U4 

PI 

U5 

P2 

T3 

N1 

N2 

T4 

Ml 

R3 

M2 

R4 



LI 
L2 
N3 
K1 
N4 
K2 
M3 
J1 
L3 
.12 
L4 
HI 
K3 
H2 
K4 
G1 
G2 
J3 



U38 

V37 

T39 

V36 

T38 

V35 

R39 

U37 

U36 

R38 

U35 

P39 

T37 

P38 

T36 

N39 

N38 



R37 

M39 

R36 

M38 

P37 

L39 

P36 

N37 

L38 

N36 

K39 

M37 

K38 

L37 

.T39 

L36 

J38 

K37 



AM3 

AL4 

AM2 

AL3 

AMI 

AL2 

AL1 

AK4 

AK2 

AK3 

AK1 

A.T4 

A.T1 

A.T3 

AH2 

AJ2 

AH3 



AG1 

AG4 

AF2 

AG3 

AF1 

AF4 

AF3 

AE2 

AE4 

AE' 

AE3 

AD2 

AD4 

API 

AC1 

AB1 

AC5 

AA2 



AM37 

AL36 

AM39 

AL37 

AL38 

AK36 

AL39 

AK37 

AK38 

AJ36 

AK39 

A.T37 

A.T38 

AH37 

AJ39 

AH38 

AH39 

AG38 

AG36 

AG39 

AG37 

AF39 

AF36 

AE38 

AF37 

AF38 

AE39 

AE36 

AD38 

AE37 

AP39 

AP36 

AC38 

AC39 
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CS 


E2.F1, J4, 


J36, H38, 


AB5, AC4, 


A*r>*">o Ann 

AB38, AD37, 




F2, H3 


J37, K36, 


AA1, AC3, 


AB39, AC35, 






H39 


Yl 


AC37 


WE 


H4 


G38 


Y2 


AA38 


OE 


El 


G39 


AC2 


AC36 


D31 


Wl 


AA39 


AT3 


AR37 



Shared SRAM 

Each FPGA has access two banks of shared SRAM, again arranged as 256K words x 
32bits. These have a 16ns access time. A series of quick switches are used to switch 
these RAMs between the FPGAs and these are controlled by the CPLD which acts as 
arbiter. To request access to a particular SRAM bank the REQUEST pin should be 
pulled low. The code should then wait until the GRANT signal is pulled low by the 
CPLD in response. 

The Handel-C code to implement this is given below: 

// define the Request and Grant interfaces for 
the Shared SRAM 

unsigned 1 shared_bankO_request=l ; 
unsigned 1 shared__bankl_request = l ; 

interface bus_out() 

sharedbkOreg (shared_bankO_request ) with 
sram_shared_bankO_reques t_pin ; 
interface bus_out() 

sharedbklreg (shared_bankl_request) with 
sram shared_bankl_request_pin; 
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interface bus_clock_in (unsigned 1) 
shared_bankO_grant ( ) with 
sram_shared_bankO__grant_pin; 
interface bus_clock_in (unsigned 1) 
shared_bankl__grant () with 
sram_shared_bankl_grant_pin ; 

// Access to a shared RAM bank 

{ 

shared__bankO_request=0 ; 

while ( shared_bankO_grant . in) delay; 

} 

// perform accesses .... 
// release bank 
shared_bankO_request=l ; 

The RAMs should be defined in the same manner as the local RAMs. (See above.) 

The connections to the shared RAMs are given in Table 17: 



Table 17 



Shared 
SRAM pin 


Master FPGA 
Shared SRAM 
0 


Slave FPGA 
Shared SRAM 
0 


Master FPGA 
Shared SRAM 1 


Slave FPGA 
Shared 
SRAM 1 


D31 


AA39 


Wl 


AR37 


AT3 


D30 


AB35 


AB4 


AR39 


AP3 


D29 


Y38 


AB3 


AR36 


AR3 


D28 


AB36 


W2 


AT38 


AT2 


D27 


Y39 


AB2 


AR38 


AP4 
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D25 


AAJo 


A A A 

AA4 


Al jy 


A T1 

All 


D24 


■\i /o n 

W39 


V2 




A XT/1 

AN 4 


D23 


A A O *7 

AA37 


A A ^ 

AA3 


a r>o o 
Ar Jo 


A T> 1 

AK1 


D22 


IT TO O 

W38 


T T 1 

Ul 


a "no r\ 

AP39 


A XTO 

AN 3 


D21 


W37 


IT 70 

W3 


A X TO ^ 

AN36 


A T*Oj 

AP2 


D20 


V39 


T TOj 

U2 


A X TO O 

AN 3 8 


A XTOj 

AN2 


D19 


IT 70 f 

W36 


W4 


A XTO *7 

AN 37 


A T% 1 

API 


D18 


U39 


Tl 


A X TO C\ 

AN39 


A \ A A 

AM4 


D17 


V38 


X TO 

V3 


A X jTO S~ 

AM36 


A XT1 

ANl 


D16 


U38 


T2 


A A jTO O 

AM38 


A X jffO 

AM3 


D15 


T TO 1 

V37 


V4 


AM37 


ATA 

AL4 


D14 


T39 


V5 


A T O Z" 

AL36 


A X jfO 

AM2 


D13 


T 70 f 

V36 


T TO 

U3 


A X >f O O 

AM39 


A T O 

AL3 


D12 


T38 


R2 


A T O *7 

AL37 


A X if 1 

AMI 


Dll 


V35 


T T /I 

U4 


A T O O 

AL38 


A T O- 

AL2 


D10 


R39 


Pi 


a Ty o 

AK36 


A T 1 

AL1 


D9 


T TO '"T 

U37 


T TC 

U5 


A T O C\ 

AL39 


A Ty A 

AK4 


D8 


U36 


P2 


A T7~ O *7 

AK37 


A T/" o. 

AK2 


D7 


R38 


TO 

T3 


A If 1 O 

AK38 


AK3 


D6 


T TO ^ 

U35 


XT 1 

Nl 


AJ36 


AT/1 

AK1 


D5 


P39 


N2 


a x/" o r\ 

AK39 


A T/1 

AJ4 


D4 


T37 


T> A 

T4 


A TO *7 

AJ37 


A T 1 

AJ1 


D3 


P38 


X X 1 

Ml 


A TO O 

AJ38 


A TO 

AJ3 


D2 


T36 


R3 


AH37 


AH2 


Dl 


N39 


M2 


A TO f\ 

AJ39 


A TO, 

AJ2 


DO 


N38 


T* A 

R4 


A T TO O 

AH38 


A T TO 

AH3 












A17 


R37 


LI 


AH39 


AGl 


A16 


M39 


T >0 

L2 


AG38 


AG4 


A15 


R36 


X TO 

N3 


A O f 

AG36 


A T^Oj 

AF2 


A14 


a jTO r» 

M38 


Kl 


a o 

AG39 


a r** o 

AG3 


A 1 O 

A13 


P37 


XT/I 

N4 


Au37 


At 1 


A 1 -O 

Al2 


T 

L39 


K2 


Ar39 


Ar4 


All 


P36 


X >fO 

M3 


A CO ^ 

Ar36 


A T70 

Ar3 


A 1 A 

A1U 


Nj / 


T1 
Jl 


A CIO 


A O 


A9 


L38 


L3 


AF37 


AE4 


A8 


N36 


J2 


AF38 


AE1 


A7 


K39 


L4 


AE39 


AE3 


A6 


M37 


HI 


AE36 


AD2 


A5 


K38 


K3 


AD38 


AD4 


A4 


L37 


H2 


AE37 


AD1 
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A2 


L36 


Gl 


AD36 


AW 1 


Al 


J38 


G2 


AC38 


AC5 


AO 


K37 


J3 


AC39 


AA2 


CS 


J36, H39, K36, 


E2,H3,F2,J4,F1 


AC37, AD37, 


AB5, AC3, 




H38, J37 




AB38, AC35. 


Yl, AA1, 








AB39 


AC4 


WF. 


G38 


H4 


AA38 


Y? 


OE 


G39 


El 


AC36 


AC2 


REOUEST 


A17 


A25 


D18 


C25 


GRANT 


B17 


B25 


E18 


D25 



Connections to the StrongARM processor 

The FPGAs are mapped to the StrongARMs memory as variable latency I/O devices, 
and are treated as by the ARM as though they were 1024 entry by 32bit RAM devices. 
The address, data and control signals associated with these RAMs are attached directly 
to the FPGAs. The manner in which the FPGAs interact with the ARM using these 
signals is left to the developer. 

The connections are as shown in Table 18: 



Table 18 



ARM pin 


Master FPGA pin 


Slave FPGA pin 


ARMA9 


A33 


Cll 


ARMAS 


C31 


Bll 


ARMA7 


B32 


C12 


ARMA6 


B31 


All 


ARMA5 


A32 


D13 


ARMA4 


D30 


B12 


ARMA3 


A31 


C13 


ARMA2 


C30 


D14 
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A D\yf A A 












ARMD31 


CIO 


wi 


ARMDiO 


Hi / 


X.T*t 


ARMD29 


T71Q 
no 


L/Z 


ARMD2X 


riio 


ri 


ARMD27 


biy 


LJi 


A Tk X yfT\0 /T 

ARMD26 


lii / 


r4 


ARMD25 


biX 


Ti1 


A T» X jTTA^ A 

ARMD24 


Ct3o 




A T» X yTT"\T5 

ARMD23 


uiy 


A /I 
A4 


ARMD22 


U3X 


Uo 


ARMD2I 


rio 


DJ 


ARMD20 


LJi/ , 


LO 


ARMD19 


bi / 


Aj 


A T* X yfr\ 1 O 

ARMDIX 


CiX 


U / 


A T> A A T\ 1 *7 

ARMD17 


bi / 


DO 


A T> A AT~\ 1 £ 

ARMDIo 


ri / 


/ 


A T> A A"f~\ 1 C 

ARMD15 


IJi j 


A A 
At) 


ARMDI4 


bio 


Uo 


ARMD13 


L,i3 


b / 


A T* X yf T\ 1 O 

ARMD12 


A 'J £ 

Aio 


PQ 


A T> A /fT\ 1 1 

ARMUl l 


Ui4 


A 7 
A / 


ARMDIO 


bi:> 


UV 


ARMD9 


L34 


r> o 
brt 


A "T» X #AO 

ARMD8 


A35 


Aft 


ARMD7 


D33 


PO 


ARMD6 


bJ4 


bv 


ARMD5 


Cii 




A Tl X jTT"\ A 

ARMD4 


A 'J A 

AJ4 


A O 

AV 


ARMD3 


DOT 

bii 


b 1 u 


ARMD2 


Diz 


pi A 


Arvivi YJ \ 




m 1 


ARMDO 


D31 


AIO 








ARMnWE 


A30 


B13 


ARMnOE 


C29 


D15 


ARMnCS4 


A29 


A13 


ARMnCS5 


B29 


C15 


ARMRDY 


B28 


B14 
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Some of the ARM's general purpose I/O pins are also connected to the FPGAs. These 
go through connector CN25 on the board, allowing external devices to be connected to 
them (see also ARM section). See Table 19. 



Table 19 





SAIO bus 


ARM GPI/O 


Master 


Slave 




(ARMGPIO) 


pins 


FPGA pin 


FPGA pin 




SATO 10 


23 


B9 


B34 




SAT09 


22 


D10 


C33 




SAT08 


21 


A9 


A34 




SAT07 


20 


CIO 


D32 




SAT06 


19 


BIO 


B33 


i! I 


SAT05 


18 


D11 


C32 




SAT04 


17 


A10 


D31 




SAI03 


11 


C11 


A33 




SAT02 


10 


B11 


C31 


\ Ft 


SAIOl 


1 


C12 


B32 




SAIO0 


0 


All 


B31 



u 

FU CPLD Interfacing 

^ 10 Listed in Table 20 are the pins used for setting the Flash Bus Master signal and 
FP COMs. Refer to the CPLD section for greater detail on this. 



Table 20 



Bus Master pin 


C17 


C26 


FP_COM pins 


B16, E17, A15 


B26, C27, A27 


[MSB..LSB] 







15 
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Local I/O devices available to each FPGA 
ATA port 

5 33 FPGA I/O pins directly connect to the ATA port. These pins have 100Q series 

termination resistors which make the port 5V 10 tolerant. These pins may also be used 
as I/O if the ATA port isn't required. See Table 21. 



Table 21 





ATA line no. 


ATA port 


Master FPGA 


Slave FPGA pin | 




ATAO 


1 


AV4 


Al 33 




ATA1 


4 


A T TZ!" 

AU6 


AW Jo 




ATA2 


3 


a n i a 

AW4 


AT Til 


L_ I 


ATA3 


6 


AT/ 


AVjj 


fy 


ATA4 


5 


AW5 


A TM 
Al 5l 




ATA5 


8 


a t in 

AU7 


A V171^ 

AW3j 




A T A H 
Al AO 


7 


AVfi 

r\ V \J 


AU32 




ATA7 


10 


AT8 


AV34 




ATA8 


9 


AW6 


AV32 




ATA9 


12 


AU8 


AW34 




ATA10 


11 


AV7 


AT31 




ATA11 


14 


AT9 


AU31 


i | 


ATA12 


13 


AW7 


AV33 




ATAO 


16 


AV8 


AT30 




ATA14 


15 


AU9 


AW33 




ATA15 


18 


AW8 


AU30 




ATA16 


17 


AT10 


AW32 




ATA17 


20 


AV9 


AT29 




ATA18 


21 


AU10 


AV31 




ATA19 


23 


AW9 


AU29 




ATA20 


25 


AT11 


AW31 




ATA21 


28 


AV10 


AV29 




ATA22 


27 


AU11 


AV30 




ATA23 


29 


AW10 


AU28 




ATA24 


31 


AU12 


AW30 




ATA25 


32 


AV11 


AT27 




ATA26 


33 


ATI 3 


AW29 




a t* a *\n 


14 


a \^r^ 1 
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ATA28 


35 


AU13 


AU27 


ATA29 


36 


AT14 


AW28 


ATA30 


37 


AV12 


AT26 


ATA31 


38 


AU14 


AV27 


ATA32 


39 


AW12 


AU26 


GND 


2,19,22, 24, 26, 30,40 



Parallel port 

A conventional 25pin D-type connector and a 26way box header are provided to access 
this port. The I/O pins have 100Q series termination resistors which also make the port 
5V I/O tolerant. These pins may also be used as I/O if the parallel port isn't required. 
See Table 22. See also Appendix C. 



Table 22 



PP line no. 


Parallel port pin 


Master FPGA pin 


Slave FPGA pin 


PPO0 


i 


A8 


A35 


PPOl 


14 


B8 


C34 


PP02 


2 


D9 


B35 


PP03 


15 


A7 


D34 


PP04 


3 


C8 


A36 


PP05 


16 


B7 


C35 


PP06 


4 


D8 


B36 


PP07 


17 


A6 


D35 


PP08 


5 


C7 


F37 


PP09 


6 


B6 


B37 


PPO10 


7 


D7 


C38 


PPOll 


8 


A5 


E37 


PPOl 2 


9 


C6 


D37 


PPOl 3 


10 


B5 


F36 


PPOl 4 


11 


D6 


D38 


PPOl 5 


12 


A4 


D39 


PPOl 6 


13 


C5 


G36 



GND | 18,19,20,21,22,23,24,25 
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Serial port 

A standard 9pin D-type connector with a RS232 level shifter is provided. This port may 
be directly connected to a PC with a Null Modem cable. A box header with 5 V tolerant 
I/O is also provided. These signals must NOT be connected to a standard RS232 
interface without an external level shifter as the FPGAs maybe damaged. See Table 
23. 



Table 23 



Serial line no. 


Serial port pin no. 


Master FPGA pin 


Slave FPGA pin 


Serial 0 (CTS) 


8 (CTS) 


AV3 


AT34 


Serial 1 (RxD) 


2 (RxD) 


AU4 


AU36 


Serial 2 (RTS) 


7 (RTS) 


AV5 


AU34 


Serial 3 (TxD) 


3 (TxD) 


AT6 


AV36 




GND 


5 


Not connected 


1,4,6,9 



Serial Header 

Each FPGA also connects to a 10 pin header (CN9/CN16). The connections are shown 
in Table 24: 



Table 24 



(CN9/CN16) 


Master 


Slave 


Header pin no. 


FPGA pin 


FPGA pin 


1 


D1 


E38 


2 


F4 


G37 


3 


D3 


E39 


4 


F3 


H36 
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5 


D2 


F38 


6 


G4 


H37 


7 


G3 


F39 




8.9 


GND 


10 


+5V 



Shared I/O Devices 

5 These devices are shared directly between the two FPGAs and great care should be 
taken as to which FPGA accesses which device at any given time. 

¥j VGA Monitor 

Ft : 

\ \x 

F! 10 A standard 1 5pin High Density connector with an on-board 4bit DAC for each colour 

ill 

i (Red, Green, Blue) is provided. This is connected to the FPGAs as set forth in Table 25: 

E 

D 

M Table 25 



VGA line 


Master FPGA pin 


Slave FPGA pin 


VGA1 1 fR3) 


AV25 


ATI 6 


VGA10 (KZ) 


AT24 


AW14 


VGA9 (R\) 


AW25 


AU16 


VGA8 (R0^ 


AU24 


AVI 5 


VGA7 (G3) 


AW24 


AR17 


VGA6 CG2) 


AW23 


AW15 


VGA5 (Gn 


AV24 


AT17 


VGA4 (GO) 


AV22 


AU17 


VGA3 fB3) 


AR23 


AV16 


VGA2 fB2) 


AW22 


AR18 


VGA1 (B\) 


AT23 


AW16 


VGAO fBO^ 


AV21 


ATI 8 


VGA13 


AW26 


AW13 


VGA12 


AU25 


AV14 
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LEDs 

Eight of the twelve LEDs on the board are connected directly to the FPGAs. See Table 
26. 



Table 26 



LED 


Master FPGA pin 


Slave FPGA pin 


D5 


AT25 


AU15 


D6 


AV26 


AV13 


D7 


AW27 


ATI 5 


D8 


AU26 


AW12 


D9 


AV27 


AU14 


D10 


AT26 


AV12 


Dll 


AW28 


AT14 


D12 


AU27 


AU13 



GPIO connector 

A 50way Box header with 5V tolerant I/O is provided. 32 data bits ('E' bus) are 
available and two clock signals. The connector may be used to implement a SelectLink 
to another FPGA. +3V3 and +5V power supplies are provided via fuses. See Table 27. 



Table 27 



Expansion 


GPI/O 


Master 


Slave FPGA 


bus line 


header pin 
no. 


FPGA pin 


pin 


E0 


11 


AT15 


AW27 


E1 


13 


AV13 


AV26 


E2 


15 


AU15 


AT25 


E3 


17 


AW13 


AW26 
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to 


zo 


AT1R 

AA I iO 


AV25 


to 


9^ 


AAV V l*t 


AT24 


b/ 


Z / 


Al MR 
nU ID 


AW? 5 

AAV VZ.«J 


bo 


1 1 
J 1 


AVI ^ 
A V 1 J 


ATT94 


PA 

by 


11 


AP 1 7 
Alvl / 


A W04 

A WZ*t 


blU 


1^ 
3!) 


AW/1 ^ 


A WO'* 

A WZj 


ri i 

bll 


17 
3 / 


ATI 7 
/\ 1 1 / 


AV94 

A V Z4 


biz 


41 


ATT1 7 
AU 1 / 


AV99 

A V ZZ 


bl3 


43 


A V1 A 
A V 10 


AP9^ 
AlvZ J 


T7 1 /I 

bl4 


4D 


Alvl 5 


A WOO 

A W ZZ 


bl5 


4/ 


A W 1 A 
AW lO 


AT9^ 

A 1 ZJ 


t? 1 /c 

bio 


44 


ATI C 
Alio 


AV91 
A V Z 1 


bl / 


/lO 
4z 


AV1 7 
AVI/ 


AT T9^ 

AUZ J 


bio 


4U 


AT T1 8 
AU 1 o 


AW91 

/A W Z 1 


rift 

bly 


1Q 
3o 


AW1 7 
A W 1 / 


AV9^ 

A V Zj 


iron 
bzll 


1/1 
3H 


AT1 0 


AR99 


bzl 


3z 


A V 1 o 


AV90 

/A V ZU 


bzz 


in 
3U 


AT T1 0 
AU iy 


AW90 

/A. W ZU 


T701 

bz3 


Zo 


AW1 R 
A W 1 0 


AV19 

/A V 1/ 


bz4 


0/1 
Z4 


AUZ 1 


AT 191 

/AUZ 1 


E25 


22 


AV19 


AW18 


E26 


20 


AW20 


AU19 


E27 


18 


AV20 


AVI 8 


E28 


14 


AR22 


AT19 


E29 


12 


AV23 


AW17 


E30 


10 


AW21 


AU18 


E31 


8 


AU23 


AV17 




CLKA 


5 CCLK 3 on diagrams') 


CLKB 


49 ( CLK 4 on diagrams') 


+5V 


1.2 


+3V3 


3.4 


GND 


6, 7, 9, 16,19, 26, 29, 36, 39, 46, 48, 50 



SelectLink Interface 



EMB1P022 



-62- 



There is another 32bit general purpose bus connecting the two FPGAs which may be 
used to implement a SelectLink interface to provide greater bandwidth between the two 
devices. The connections are set forth in Table 28: 



Table 28 



f 


SelectLink Line 


Master 
FPGA pin 


Slave 
FPGA pin 




SLO 


AV28 


AW11 




SL1 


AW29 


AT13 




SL2 


AT27 


AV11 




SL3 


AW30 


AU12 




SL4 


AU28 


AW10 


iO 


SL5 


AV30 


AU11 


SJ 


SL6 


AV29 


AV10 


~~4 


SL7 


AW31 


AT11 


z : : 

£ -ST 


SL8 


AU29 


AW9 




SL9 


AV31 


AU10 


Ld 


SL10 


AT29 


AV9 


s 


SL11 


AW32 


AT10 


I s 


SL12 


AU30 


AW8 




SL13 


AW33 


AU9 


ru 


SL14 


AT30 


AV8 


— 


SL15 


AV33 


AW7 




SL16 


AU31 


AT9 




SL17 


AT31 


AV7 




SL18 


AW34 


AU8 




SL19 


AV32 


AW6 




SL20 


AV34 


AT8 




SL21 


AU32 


AV6 




SL22 


AW35 


AU7 




SL23 


AT32 


AU8 




SL24 


AV35 


AT7 




SL25 


AU33 


AW4 




SL26 


AW36 


AU6 




SL27 


AT33 


AV4 




SL28 


AV36 


AT6 




SL29 


AU34 


AV5 




SL30 


AU36 


AU4 
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SL31 



AT34 



AV3 



USB 

The FPGAs have shared access to the USB chip on the board. As in the case of the 
Flash RAM, the FPGA needs to notify the CPLD that it has taken control of the USB 
chip by setting the USBMaster pin low before accessing the chip. For more information 
on the USB chip refer to the USB section of this document. 



Table 29 




USBMaster 


D17 


D26 


USBMS 


C16 


D27 


nRST 


B15 


B27 


IRQ 


D16 


C28 


AO 


A14 


A28 


nRD 


B14 


B28 


nWR 


C15 


B29 


nCS 


A13 


A29 


D7 


D15 


C29 


D6 


B13 


A30 


D5 


C14 


D29 


D4 


A12 


B30 


D3 


D14 


C30 


D2 


C13 


A31 


Dl 


B12 


D30 


DO 


D13 


A32 



CPLD 

The board is fitted with a Xilinx XC95288XL CPLD which provides a number of Glue 
Logic functions for shared RAM arbitration, interfacing between the ARM and FPGA 
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and configuration of the FPGAs. The later can be used to either configure the FPGAs 
from power up or when one FPGA re-configures the other (Refer to section 
'Programming the FPGAs'). A full listing of ABEL code contained in the CPLD can be 
found in Appendix D. 

Shared SRAM bank controller 

The CPLD implements a controller to manage the shared RAM banks. A Request - 
Grant system has been implemented to allow each SRAM bank to be accessed by one of 
the three devices. A priority system is employed if more than one device requests the 
SRAM bank at the same time. 



Highest priority : ARM 

Master FPGA 
Lowest priority : Slave FPGA 

The FPGAs request access to the shared SRAM by pulling the corresponding 
REQUEST signals low and waiting for the CPLD to pull the GRANT signals low in 
response. Control is relinquished by setting the REQUEST signal high again. The ARM 
processor is able to request access to the shared SRAM banks via some registers within 
the CPLD - refer to the next section. 



CPLD Registers for the ARM 



The ARM can access a number of registers in the CPLD, as shown in Table 30: 



Table 30 



0x00 This is an address indirection register for register 1 which used 
for the data access. 
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0 Write only FLASH Address A0-A7 

1 Write only FLASH Address A8-A1 5 

2 Write only FLASH Address Al 6-A24 

3 Read / Write FLASH data (Access time must be at least 

150ns) 

14 Backlight brightness 

5 Write Only USB control (RST / MS ) 

DO : USB RESET 

Dl : USB Master Slave 



0x04 



Data for register 0 address expanded data 



0x08 



Master FPGA access 



OxOC 



Slave FPGA access 



0x10 



SRAM Arbiter 

DO: Shared SRAM bank 0 Request ( high to request, low to 
relinquish ) 

Dl: Shared SRAM bank 1 Request ( high to request, low to 
relinquish ) 

D4: Shared SRAM bank 0 Granted ( High Granted Low not 
Granted ) 

D5: Shared SRAM bank 1 Granted ( High Granted Low not 
Granted ) 



0x14 



Status / FPGA control pins ( including PLL control ) 
Write 

DO : Master FPGA nPROGRAM pin 

Dl : Slave FPGA nPROGRAM pin 

D2 : Undefined 

D3 : Undefined 

D4 : PLL Serial clock pin 

D5 : PLL Serial data pin 
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D6 : PLL Feature Clock 




D7 : PLL Internal Clock select 




Read 




DO : Master FPGA DONE Signal 




Dl : Slave FPGA DONE signal 




D2 : FPGA INIT Signal 




D3 : FLASH status Signal 




D4 : Master FPGA DOUT Signal 




D5 : Slave FPGA DOUT Signal 




D6 : USB IRQ Signal 


0x18 


USB Register 0 


OxlC 


USB Register 1 



CPLD Registers for the FPGA 's 

The FPGAs can access the CPLD by setting a command on the FPCOM pins. Data 
transferred on the FPGA (Flash RAM) databus. See Table 31. 



Table 31 




0x0 


Write to Control Register 




DO : Master FPGA Program signal ( inverted) 




Dl : Slave FPGA Program signal (inverted) 




D2 : Master FPGA chip select signal (inverted) 




D3 : Slave FPGA chip select signal (inverted) 


0x3 


Sets configuration clock low 
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0x5 


Read Status Register 

DO : Master FPGA DONE signal 
Dl : Slave FPGA DONE signal 
D2 : FPGA INIT signal 
D3 : FLASH status signal 
D4 : Master FPGA DOUT signal 
D5 : Slave FPGA DOUT signal 
D6 : USB IRQ signal 


0x7 


No Operation 



These commands will mainly be used when one FPGA reconfigures the other. Refer to 
the FPGA configuration section and the appropriate Xilinx datasheets for more 
information. 

CPLDLEDs 

Four LED's are directly connected to the CPLD. These are used to indicate the 
following: 

DO DONE LED for the Master FPGA Flashes during programming 
D 1 DONE LED for the Slave FPGA Flashes during programming 
D2 Not used 

D3 Flashes until an FPGA becomes programmed 
Other Devices 
USB 

The board has a SCAN Logic SL1 1H USB interface chip, capable of full speed 
20 12Mbits/s transmission. The chip is directly connected to the FPGAs and can be 
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accessed by the ARM processor via the CLPD (refer to the CPLD section of this 
document for further information). 

The datasheet for this chip is available at http://www.scanlogic.com/pdfysll lh 
/si Uhspec.pdf 

5 

PSU 

This board maybe powered from an external 12V DC power supply through the 2.1mm 
DC JACK. The supply should be capable of providing at least 2.4A. 

10 

Handel-C Library Reference 

Introduction 

This section describes the Handel-C libraries written for the board. The klib.h library 
15 provides a number of macro procedures to allow easier access to the various devices on 
the board, including the shared memory, the Flash RAM, the CPLD and the LEDs. Two 
other libraries are also presented, parallel_port.h and serial_port.h, which are generic 
Handel-C libraries for accessing the parallel and serial ports and communicating over 
these with external devices such as a host PC. 

20 

Also described is an example program which utilizes these various libraries to 
implement an echo server for the parallel and serial ports. 

Also described here is a host side implementation of ESL's parallel port data transfer 
25 protocol, to be used with the data transfer macros in parallel_port.h. 

The klib.h Library 
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Shared RAM arbitration 

A request - grant mechanism is implemented to arbitrate the shared RAM between the 
two FPGAs and the ARM processor. Four macros are provided to make the process of 
requesting and releasing the individual RAM banks easier. 

KRequestMemoryBankOO; 
KRequestMemoryBankl Q; 
KReleaseMemoryBankOQ; 
KReleaseMemoryBankl (); 

Arguments 

None. 

Return Values 

None. 

Execution Time 

KRequestMemoryBank#0 requires at least one clock cycle. 
KReleaseMemoryBank#() takes one clock cycle. 

Description 

These macro procedures will request and relinquish ownership of their respective 
memory banks. When a request for a memory bank is made the procedure will block the 
thread until access to the requested bank has been granted. 

Note: The request and release functions for different banks may be called in 
parallel with each other to gain access to or release both banks in the same cycle. 
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Flash RAM Macros 

These macros are provided as a basis through which interfacing to the Flash RAM can 
be carried out. The macros retrieve model and status information from the RAM to 
illustrate how the read/write cycle should work. Writing actual data to the Flash RAM is 
more complex and the implementation of this is left to the developer. 

KSetFPGAFBMQ 
KReleaseFPGAFBMO 

Arguments 

None. 

Return Values 

None. 

Execution Time 

Both macros require one clock cycle. 
Description 

Before any communication with the Flash RAM is carried out the FPGA needs to let the 
CPLD know that it is taking control of the Flash RAM. This causes the CLPD to tri- 
state the Flash bus pins, avoiding resource contention. KSetFPGAFBM() sets the Flash 
Bus Master (FBM) signal and KReleaseFPGAFBMO releases it. This macro is 
generally called by higher level macros such as KReadFlashO or KWriteFlashO- 

Note: These two procedures access the same signals and should NOT be called in 
parallel to each other. 



EMB1P022 



-71 - 



KEnableFlashO 
KDisableFlashO 

Arguments 

None. 

Return Values 

None. 

Execution Time 

Both macros require one clock cycle. 
Description 

These macros raise and lower the chip-select signal of the Flash RAM and tri-state the 
FPGA Flash RAM lines (data bus, address bus and control signals). This is necessary if 
the Flash RAM is to be shared between the two FPGAs as only one chip can control the 
Flash at any give time. Both FPGAs trying to access the Flash RAM simultaneously can 
cause the FPGAs to 'latch up 5 or seriously damage the FPGAs or Flash RAM chip. This 
macro is generally called by higher level macros such as KReadFlash() or 
KWriteFlash(). 

Note: These macros access the same signals and should NOT be called in parallel 
with each other. 

KWriteFlash(address, data) 
KReadFlash(address t data) 

Arguments 

24 bit address to be written or read. 
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8 bit data byte. 
Return Values 

KReadFlashO returns the value of the location specified by address in the data 
parameter. 

Execution Time 

Both procedures take 4 cycles. 

The procedures are limited by the timing characteristics of the Flash RAM device. A 
read cycle takes at least 120ns, a write cycle 100ns. The procedures have been set up for 
aHandel-C clock of 25MHz. 

Description 

The macros read data from and write data to the address location specified in the 
address parameter. 

Note: These macros access the same signals and should NOT be called in parallel 
with each other. 

KSetFlashAddress(address) 

Arguments 

24 bit address value. 

Return Values 

None. 

Execution Time 
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This macro requires one clock cycle. 
Description 

The macro sets the Flash address bus to the value passed in the address parameter. This 
macro is used when a return value of the data at the specified location is not required, as 
may be the case when one FPGA is configuring the other with data from the Flash 
RAM since the configuration pins of the FPGAs are connected directly to the lower 8 
data lines of the Flash RAM. 

KReadFlashID(flash_component_ID, manufacturer ID) 
KReadFlashStatus(status) 

Arguments 

8 bit parameters to hold manufacturer, component and status information. 
Return Values 

The macros return the requested values in the parameters passed to it. 
Execution Time 

KReadFlashStatus() requires 10 cycles, 
KReadFlashID() requires 14 cycles. 

Description 

The macros retrieve component and status information from the Flash RAM. This is 
done by performing a series of writes and reads to the internal Flash RAM state 
machine. 
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Again, these macros are limited by the access time of the Flash RAM and the number of 
cycles required depends on rate the design is clocked at. These macros are designed to 
be used with a Handel-C clock rate of 25MHz or less. 

5 Although a system is in place for indicating to the CPLD that the Flash RAM is in use 
(by using the KSetFPGAFBMO and KReleaseFPGAFBMO macros) it is left up to the 
developers to devise a method of arbitration between the two FPGAs. As all the Flash 
RAM lines are shared between the FPGAs and there is no switching mechanism as in 
the shared RAM problems will arise if both FPGAs attempt to access the Flash RAM 

10 simultaneously. 

Note: These macros access the same signals and should NOT be called in parallel with 
each other. Also note that these macros provide a basic interface for communication 
with the Flash RAM. For more in-depth please refer to the Flash RAM datasheet. 

CPLD Interfacing 

The following are macros for reading and writing to the CPLD status and control 
registers: 

KReadCPLDStatus(status) 
KWriteCPLDControl(control) 

Arguments 

25 8 bit word 

Return Values 

KReadStatus() returns an 8 bit word containing the bits of the CPLD's status register. 
(Refer to the CPLD section for more information) 
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Execution Time 

Both macros require six clock cycles, at a Handel-C clock rate of 25MHz or less. 



Description 

These macros read the status register and write to the control register of the CPLD. 



KSetFPCOM(fp_command) 

Arguments 

3 bit word. 

Return Values 

None. 

Execution Time 

This macro requires three clock cycles, at a Handel-C clock rate of 25MHz or less. 
Description 

This macro is provided to make the sending of FPCOMMANDs to the CPLD easier. 
FP COMMANDS are used when the reconfiguration of one FPGA from the other is 
desired (refer to the CPLD section for more information). 

The different possible fp_command (s) are set forth in Table 32: 

Table 32 

FPSETJDLE Sets CPLD to idle 

FP READ_STATUS Read the status register of the CPLD 

FP WRITE CONTROL Write to the control register of the CPLD 
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FP_CCLK_LOW Set the configuration clock low 

FP_CCLK_HIGH Set the configuration clock high 

e-g- 

KSetFPCOM(FP_READ_STATUS); 
KSetFPCOM(FP_SET_IDLE); 

Note: These macros access the same signals and should NOT be called in parallel 
with each other. 

LEDs 

KSetLEDs(maskByte) 

Arguments 
8 bit word. 

Return Values 

None. 

Execution Time 

One clock cycle. 

Description 

This macro procedure has been provided for controlling the LEDs on the board. The 
maskByte parameter is applied to the LEDs on the board, with a 1 indicating to turn a 
light on and a 0 to turn it off. The MSB of maskByte corresponds to D12 and the LSB 
to D5 on the board. 
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Note: Only one of the FPGAs may access this function. If both attempt to do so 
the FPGAs will drive against each other and may 'latch-up', possibly damaging 
them. 

Using the Parallel Port 
Introduction 

The library parallel_port.h contains routines for accessing the parallel port. This 
implements a parallel port controller as an independent process, modeled closely on the 
parallel port interface found on an IBM PC. The controller allows simultaneous access 
to the control, status and data ports (as defined on an IBM PC) of the parallel interface. 
These ports are accessed by reading and writing to channels into the controller process. 
The reads and writes to these channels are encapsulated in other macro procedures to 
provide an intuitive API. 

Figure 11 shows a structure of a Parallel Port Data Transmission System 1100 
according to an embodiment of the present invention. An implementation of ESL's 
parallel data transfer protocol has also been provided, allowing data transfer over the 
parallel port, to and from a host computer 1102. This is implemented as a separate 
process which utilizes the parallel port controller layer to implement the protocol. Data 
can be transferred to and from the host by writing and reading from channels into this 
process. Again macro procedure abstractions are provided to make the API more 
intuitive. 

A host side application for data transfer under Windows95/98 and NT is provided. Data 
transfer speeds of around 100 Kbytes/s can be achieved over this interface, limited by 
the speed of the parallel port. 
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Accessing the parallel port directly. 

The 17 used pins of the port have been split into data, control and status ports 
in the IBM PC parallel port specification. See Table 33. 



Table 33 



Port Name 


Pin number | 


Data Port 




Data 0 


2 


Data 1 


3 


Data 2 


4 


Data 3 


5 


Data 4 


6 


Data 5 


7 


Data 6 


8 


Data 7 


9 






Status Port 




nACK 


10 


Busy 


11 


Paper Empty 


12 


Select 


13 


nError 


15 






Control Port 




nStrobe 


1 


nAutoFeed 


14 


Initialise Printer 


16 
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(Init) 




nSelectln 


17 



The parallel port controller process needs to be run in parallel with those part of the 
program wishing to access the parallel port. It is recommended that this is done using a 
par{} statement in the mainO procedure. 

The controller procedure is: 

parallel _port( pp_data_send_channel, 
ppjdatareadjchannel, 
ppjcontrol _portjread t 
ppstatus _port_read y 
pp_status j)ortjwrite); 

where the parameters are all channels through which the various ports can be accessed. 
Parallel Port Macros 

It is recommended that the following macros be used to access the parallel port rather 
than writing to the channels directly. 

Pp WriteData(byte) 
PpReadData(byte) 

Arguments 

Unsigned 8 bit word. 

Return Values 
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PpReadDataO returns the value of the data pins in the argument byte. 
Execution Time 

Both macros require one clock cycle. 
Description 

These write the argument byte to the register controlling the data pins of the port, or 
return the value of the data port within the argument byte respectively, with the MSB 
of the argument corresponding to data[7]. Whether or not the value is actually placed on 
the data pins depends on the direction settings of the data pins, controlled by bit 6 of the 
status register. 

PpReadControl( control _port) 

Arguments 

Unsigned 4 bit word. 

Return Values 

PpReadControl() returns the value of the control port pins in the argument byte. 
Execution Time 

This macro requires one clock cycle. 
Description 

This procedure returns the value of the control port. The 4 bit nibble is made up of 
[nSelect_in @ Init @ nAutofeed @ nStrobe], where nSelect_in is the MSB. 

PpReadStatus(status _port) 
PpSetStatus (status j>ort) 
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Arguments 

Unsigned 6 bit word. 

Return Values 

PpReadStatusO returns the value of the status port register in the argument byte. 
Execution Time 

This macro requires one clock cycle. 
Description 

These read and write to the status port. The 6 bit word passed to the macros is made up 
of [ppdirection @ busy @ nAck @ PE @ Select @ nError], where pp_direction 
indicates the direction of the data pins (i.e. whether they are in send [1] or receive [0] 
mode). It is important that this bit is set correctly before trying to write or read data 
from the port using PpWriteData() or PpReadDataO- 

Note: All of the ports may be accessed simultaneously, but only one operation 
may be performed on each at any given time. Calls dealing with a particular port 
should not be made in parallel with each other. 

Transferring data to and from the host PC 

The library parallel _port.h also contains routines for transferring data to and from a 
host PC using ESL's data transfer protocol. The data transfer process, pp_coms(), which 
implements the transfer protocol should to be run in parallel to the parallel port 
controller process, again preferably in the main par{} statement. A host side 
implementation of the protocol, ksend.exe, is provided also. 
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ppjcoms(pp_sendjchan, 
pp_recv_chan f 
pp_command t 
pp_error) 



- channel to write data to when sending 

- channel to read data from when receiving 

- channel to write commands to 

- channel to receive error messaged from. 



The following macros provide interfaces to the data transfer process: 



10 



OpenPP(error) - open the parallel port for data transfer 
ClosePP (error) - close the port 



Note: Make sure that the host side application, ksend.exe, is running. The macros 

'H 1 will try and handshake with the host and will block (or timeout) until a response is 

ry received. Also note that the following macros all access the same process and 

7 \ should NOT be called in parallel with each other. 

=F 15 

p Arguments 

\ ^ 

!T] Unsigned 2 bit word. 

lI Return Values 

20 The argument will return an error code indicating the success or failure of the 
command. 



Execution Time 

This macro requires one clock cycle. 
Description 

These two macros open and close the port for receiving or sending data. They initiate a 
handshaking procedure to start communications with the host computer. 
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SetSendMode(error) - set the port to send mode 

SetRecvMode(error) - set the port to receive mode 

Arguments 

Unsigned 2 bit word. 

Return Values 

The argument will return an error code indicating the success or failure of the 
command. 

Execution Time 

This macro requires one clock cycle. 
Description 

These set the direction of data transfer and the appropriate mode should be set before 
attempting to send or receive data over the port. 

SendPP(byte, error) - send a byte over the port 
ReadPP(byte, error) - read a byte from the port 

Arguments 

Unsigned 8 bit and unsigned 2 bit words. 
Return Values 

ReadPP() returns the 8 bit data value read from the host in the byte parameter. 

Both macros will return an error code indicating the success or failure of the command. 

Execution Time 
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How quickly these macros execute depend on the Host. The whole sequence of 
handshaking actions for each byte need to be completed before the next byte can be 
read or written. 

5 Description 

These two macros will send and receive a byte over the parallel port once this has been 
initialized and placed in the correct mode. 

The procedures return a two bit error code indicating the result of the operation. These 
10 codes are defined as: 

#define PP_NO_ERROR 0 
#define PPHOSTBUFFERNOTFINISHED 1 
#defme PP_OPEN_TIMEOUT 2 

15 

Note: SendPP and ReadPP will block the thread until a byte is transmitted or the 
timeout value is reached. If you need to do some processing while waiting for a 
communication use a 'prialt' statement to read from the global pp_recv_chan 
channel or write to the pp_send_chan channel. 

20 

Typical macro procedure calls during Read / Write 

Figure 12 is a flowchart that shows the typical series of procedure calls 1200 when 
receiving data. Figure 13 is a flow diagram depicting the typical series of procedure 
25 calls 1300 when transmitting data. 

The Ksend application 
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The ksend.exe application is designed to transfer data to and from the board FPGAs 
over the parallel port. It implements the ESL data transfer protocol. It is designed to 
communicate with the pp_coms() process running on the FPGA. This application is still 
in the development stage and may have a number of bugs in it. 

5 

Two versions of the program exist, one for Windows95/98 and one for WindowsNT. 
The NT version requires the GenPort driver to be installed. Refer to the GenPort 
documentation for details of how to do this. 

10 In its current for the ksend application is mainly intended for sending data to the board, 
D as is done in the esl boardtest program. It is how ever also able to accept output form 

SI the board. Again, please refer to the application note or the ksend help (invoked by 

~1 calling ksend without any parameters) for further details. 

1 y 

: ass 

SHI 

t-f : 

hi 

2 15 Serial Port 

xssz. 
%ssS 

^ Introduction 

m 

q Each FPGA has access to a RS232 port allowing it to be connected to a host PC. A 

^ 20 driver for transferring data to and from the FPGAs from over the serial port is contained 
in the file serial_port.h. 

RS232A Interface 

There are numerous ways of implementing RS232 interfacing, depending on the 
25 capabilities of the host and device and what cables are used. This interface is 

implemented for a cross wired null modem cable which doesn't require any hardware 
handshaking - the option of software flow control is provided, though this probably 
won't be necessary as the FPGA will be able to deal with the data at a much faster rate 
than the host PC can provide it. When soft flow control is used the host can stop and 
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start the FPGA transmitting data by sending the XON and XOFF tokens. This is only 
necessary when dealing with buffers that can fill up and either side needs to be notified. 

Serial port macros 

Serial port communications have been implemented as a separate process that runs in 
parallel to the processes that wish to send/ receive data. Figure 14 is a flow diagram 
illustrating several processes 1402, 1404 running in parallel. 

The serial port controller process is 

serial _port(sp_input, sp_output); 

where sp_input and sp_output are n bit channels through which data can be read or 
written out form the port. These reads and writes are again encapsulated in separate 
macro procedures to provide the user with a more intuitive API. 

SpReadData(byte) - read a data byte from the port 
SpWriteData(byte) - write a byte to the port 

Arguments 

n bit words, where n is the number of data bits specified. 
Return Values 

SpReadData() returns an n bit value corresponding to the transmitted byte in the 
argument. 

Execution Time 

The execution time depends to the protocol and the baud rate being used. 
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Description 

These procedures send and receive data over the serial port using the RS232 protocol. 
The exact communications protocol must be set up using a series of #defines before 
including the serial_port.h library. To use an 8 data bit, 1 start and 1 stop bit protocol at 
1 15200 baud on a null modem cable with no flow control the settings would be: 

#define BAUD_RATE 115200 
#define START_BIT ((unsigned 1)0) 

#define STOP_BIT ((unsigned 1)1) 

#define NUM_D ATA_B ITS 8 



Other options are: 

For soft flow control: 

#define SOFTFLOW 
#define XON 
#define XOFF 



<ASCII CHARACTER CODE> 
<ASCII CHARACTER CODE> 



RTS/CTS flow control: 

#define HARDFLOW 



The default settings are: 

Baud rate 9600 

Start bit 0 

Stop bit 1 

Num. data bits 8 

XON 17 

XOFF 19 
Flow control off 
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5 



Any of the standard baud rate settings will work provided that the Handel-C clock rate 
is at least 8 times higher than the baud rate. Also ensure that the macro CLOCK_RATE 
is defined, this is generally found in the pin definition header for each of the FPGAs. 

e.g. 

#define CLOCK_RATE 25000000 // define the 
clock rate 



10 Example Program 

M3 Shown here is an example Handel-C program that illustrates how to use the parallel and 

sa serial port routines found in the serial_port.h and parallel_porth libraries. The program 

? ? i implements a simple echo server on the serial and parallel ports. The SetLEDsO 

W 15 function from the klib.h library is used to display the ASCII value received over the 

E 

2 serial port on the LEDs in binary. 

py // Include the necessary header files 

"~ # define MASTER 

20 #ifdef MASTER 

#include "KompressorMaster .h" 
#else 

#include "KompressorSlave . h" 



25 



#endif 

#include "stdlib.h" 
#include "parallel_port .h" 
# include "klib.h" 



30 // Define the protocol and include the file 
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#define BAUD_RATE 9600 
#define NUM_DATA_B ITS 8 
#define NULLMODEM 
#include "serialjoort .h" 

II III 1 1! 1 1! Ill III II I II i II 1 1 1 i ' ! 1 11 

II Process to echo any data received by the parall 
port 

//to verify it is working properly 

macro proc EchoPPO 
{ 

unsigned 8 pp_data_in; 

unsigned 2 error with {warn = 0}; 

unsigned 1 done; 

OpenPP (error) ; // initiate contact with host 
while ( Idone) 
{ 

// read a byte 
SetRecvMode (error) ; 
ReadPP (pp_data_in, error); 

// echo it 
SetSendMode (error) ; 
WritePP(pp_data_in, error); 

} 

ClosePP (error) ; // close connection 

} 
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////////////////////////////////// 

// Process to echo any data received by the serial 
port 

// to verify it is working properly. We are always 
// listening on the serial port so there is no need 
to open it . 

macro proc EchoSPO 

{ 

unsigned 8 serial_in_data; 

while (1) 
{ 

SpReadData (serial__in_data) ; // read a byte 
from the serial port 

SetLEDs (serial_in_data) ; 

SpWriteData (serial_in_data) ; // write it 

back out 

} 

delay; // avoid combinational cycles 

} 

void main (void) 

{ 

while (1) 
{ 

par 

{ 

EchoPPO; //Parallel port thread 
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EchoSPO; // Serial port thread 



Ui 

z : 
= 5 : 



1 1 1 1 1 1 Start the services 1 1 1 1 1 1 1 1 
II Parallel Port stuff 
5 pp_coms (pp_send_chan, pp_recv_chan, 

pp_command, pp_error) ; 

parallel jport (pp_data_send_channel , 
pp_data_read_channel , 
10 pp_control_port_read, 

pp_status_jport_read, pp_status_port_write) ; 

// Serial port stuff // 

serial jport (sp_input , sp_output) ; 



15 



} 



20 The code can be compiled for either FPGA by simple defining or un-defming the 
MASTER macro - lines 1 to 5 



More Information 

25 Useful information pertaining to the subjects of this described herein can be found in 
the following: The Programmable Logic Data Book, Xilinx 1996; Handel-C 
Preprocessor Reference Manual, Handel-C Compiler Reference Manual, and Handel-C 
Language Reference Manual, Embedded Solutions Limited 1998; and Xilinx Datasheets 
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and Application notes, available from the Xilinx website http://www.xilinx.com. and 
which are herein incorporated by reference. 

Illustrative Embodiment 

According to an embodiment of the present invention, a device encapsulates the 
Creative MP3 encoder engine in to an FPGA device. Figure 15 is a block diagram of an 
FPGA device 1500 according to an exemplary embodiment of the present invention. 
The purpose of the device is to stream audio data directly from a CD 1502 or CDRW 
into the FPGA, compress the data, and push the data to a USB host 1504 which delivers 
it to the OASIS(Nomad 2) decoder. The entire operation of this device is independent 
of a PC. 

The design of the FPGA uses the "Handel-C" compiler, described above, from 
Embedded Solutions Limited (ESL). The EDA tool provided by ESL is intended to 
rapidly deploy and modify software algorithms through the use of FPGAs without the 
need to redevelop silicon. Therefore the ESL tools can be utilized as an alternative to 
silicon development and can be used in a broader range of products. 

Feature Overview 

The FGPA preferably contains the necessary logic for the following: 

- MP3 Encoder 1506 

- User Command Look Up Table 

- play 

- pause 

- eject 

- stop 

- skip song (forward / reverse) 
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- scan song (forward / reverse) 

- record (rip to MP3) -> OASIS Unit 

- ATAPI 

- command and control 

- command FIFO 

- data bus 

- command bus 

- (2) 64 sample FIFOs (16bit * 44.100 kHz) 

- Serial Port (1 6550 UART) optionally EEPROM Interface (I2C & I2S) 

- USB Interface to host controller 

- SDRAM controller 

- 32-bit ARM or RISC processor 

In addition to the FPGA the following is preferably provided: 

- USB Host / Hub controller (2 USB ports) 

- 4MB SDRAM 

- 1 2 8K EEPROM 

- 9-pin serial port 

- 6 control buttons. 

- 40-Pin IDE Interface for CD or CDRW 
Interfaces 

ATAPI (IDE) Interface 
User Interface 
USB Interface 
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Network-Based Configuration 

Figure 16 illustrates a process 1600 for network-based configuration of a programmable 
logic device. In operation 1602, a default application is initiated on a programmable 
logic device. In operation 1604, a file request for configuration data from the logic 
device is sent to a server located remotely from the logic device utilizing a network. 
The configuration data is received from the network server in operation 1606, and can 
be in the form of a bitfile for example. In operation 1608, the configuration data is used 
to configure the logic device to run a second application. The second application is run 
on the logic device in operation 1610. 

According to one embodiment of the present invention, the logic device includes one or 
more Field Programmable Gate Arrays (FPGAs). Preferably, a first FPGA receives the 
configuration data and uses that data to configure a second FPGA. The first and second 
FPGAs can be clocked at different speeds. 

According to another embodiment of the present invention, the default application and 
the second application are both able to run simultaneously on the logic device. The 
logic device can further include a display screen, a touch screen, an audio chip, an 
Ethernet device, a parallel port, a serial port, a RAM bank, a non-volatile memory, 
and/or other hardware components. 

Figure 17 illustrates a process 1700 for remote altering of a configuration of a hardware 
device. A hardware device is accessed in operation 1702 utilizing a network such as the 
Internet, where the hardware device is configured in reconfigurable logic. In operation 
1704, a current configuration of the hardware device is detected prior to selecting 
reconfiguration information. Reconfiguration information is selected in operation 1706, 
and in operation 1708, is sent to the hardware device. In operation 1710, the 
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reconfiguration information is used to reprogram the reconfigurable logic of the 
hardware device for altering a configuration of the hardware device. 

The reconfiguration of the hardware device can be performed in response to a request 
received from the hardware device. In an embodiment of the present invention, the 
hardware device is accessed by a system of a manufacturer of the hardware device, a 
vendor of the hardware device, and/or an administrator of the hardware device. 

In another embodiment of the present invention, the logic device includes at least one 
Field Programmable Gate Array (FPGA). Preferably, a first FPGA receives the 
reconfiguration information and uses the reconfiguration information for configuring a 
second FPGA. 

Illustrative Embodiment 

Figure 18 illustrates a process 1800 for processing data and controlling peripheral 
hardware. In operation 1802, a first Field Programmable Gate Array (FPGA) of a 
reconfigurable logic device is initiated. The first FPGA is configured with 
programming functionality for programming a second FPGA of the logic device in 
accordance with reconfiguration data. The reconfiguration data for configuring the 
second FPGA is retrieved in operation 1804. In operation 1806, the first FPGA is 
instructed to utilize the reconfiguration data to program the second FPGA to run an 
application. In operation 1808, the first FPGA is instructed to user the reconfiguration 
data to program the second FPGA to control peripheral hardware incident to running the 
application. 

In one embodiment of the present invention, data stored in nonvolatile memory is 
utilized for configuring the first FPGA with the programming functionality upon 
initiation of the first FPGA. In another embodiment of the present invention, the 
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configuration data is retrieved from a server located remotely from the logic device 
utilizing a network. The configuration data can be received in the form of a bitfile. 

The first and second FPGA's can be clocked at different speeds. Preferably, the logic 
device also includes a display screen, a touch screen, an audio chip, an Ethernet device, 
a parallel port, a serial port, a RAM bank, and/or a non-volatile memory. 

Further Embodiments and Equivalents 

While various embodiments have been described above, it should be understood that 
they have been presented by way of example only, and not limitation. Thus, the breadth 
and scope of a preferred embodiment should not be limited by any of the above 
described exemplary embodiments, but should be defined only in accordance with the 
following claims and their equivalents. 
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